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1.  Introduction 


Nonpar axnetric  tests,  based  on  ranks,  have  proved  useful  in  a  variety  of  applications. 
They  are  often  preferred  to  normal  theory  procedures,  since  rank  tests  require  few  assump¬ 
tions  about  the  underlying  distribution  generating  the  data.  An  examination  of  the  book 
of  Hollander  and  Wolfe  (1973),  for  example,  reveals  that  rank  tests  have  been  developed  for 
a  multitude  of  hypothesis  testing  situations,  including  the  two-sample  and  multi-sample 
location  problems,  the  two-way  layout  problem,  the  one-sample  location  problem,  the  two- 
sample  dispersion  problem  with  equal  medians,  and  problems  of  testing  for  trend  and  for 
independence. 

One  shortcoming  of  the  present  theory  of  rank  tests  is  that  such  tests  have  usually 
been  constructed  on  a  case-by-case  basis,  in  a  quite  ad  hoc  (albeit  clever)  manner.  They 
depend  strongly  on  the  particular  testing  situation  under  consideration,  and  often  do  not 
extend  readily  to  other  hypothesis  testing  problems.  The  work  of  Hijek  and  Sidak  (1967) 
addresses  this  issue  in  some  special  cases,  and  contains  numerous  statements  such  as  the 
following  (page  95):  “from  the  intuitive  point  of  view,  the  [Ansari-Bradley]  test  [of  differ¬ 
ence  in  scale]  appears  to  be  an  analogue  of  the  Wilcoxon  test  [for  the  two-sample  location 
problem].”  Moreover,  Hajek  and  Sidak  (page  98)  describe  the  Siegel-Tukey  (1960)  method 
of  converting  tests  of  location  into  tests  of  dispersion,  and  also  discuss  the  literature  on  ex¬ 
tending  two-sample  location  tests  to  the  multi-sample  problem  (pages  106-108).  However, 
there  is  no  truly  general  procedure  for  extending  an  arbitrary  rank  test  from  one  testing 
situation  to  another. 

This  paper  introduces  a  general,  yet  simple  construction,  which  produces  many  test 
statistics  for  all  of  the  hypothesis  testing  situations  mentioned  in  the  first  paragraph. 
Indeed,  the  construction  gives  rise  to  the  following  familiar  rank  test  statistics: 

(l)  the  Mann- Whitney  statistic,  the  Wilcoxon  statistic,  the  Mood  “median  test”  statis- 
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tic,  the  Kolmogorov-Smiraov  statistic,  and  the  Wald-Wolfowitz  statistic  for  the  two-sample 
location  problem; 

(2)  the  Jonckheere-Terpstra  statistic  for  the  multi-sample  location  problem  with  or¬ 
dered  alternatives; 

(3)  the  Page  test  statistic  for  the  two-way  layout  with  ordered  alternatives; 

(4)  the  Wilcoxon  signed  rank  test  statistic  and  the  Fisher  sign  test  statistic  for  the 
one-sample  location  problem; 

(5)  the  Ansari-Bradley  statistic  and  the  “quartile  test”  statistic  for  the  two-sample 
dispersion  problem  with  equal  medians. 

In  addition,  the  construction  gives  rise  to  many  less  familiar,  yet  equally  plausible  test 
statistics.  The  author  is  encouraged  by  these  results,  and  is  therefore  hopeful  that  this 
novel  approach  may  provide  the  basis  for  a  new,  more  unified  study  of  rank  tests. 

2.  The  Basic  Construction 

The  proposed  construction  uses  distances  between  permutations,  and  between  equiv¬ 
alence  classes  of  permutations.  Such  distances,  or  metrics,  have  recently  enjoyed  increased 
attention  in  the  statistical  literature.  Several  well  known  metrics  on  permutations  are 
presented  and  discussed  briefly  in  Section  3;  they  will  prove  essential  to  our  study  of  rank 
tests. 

The  problem  of  constructing  a  nonparametric  rank  test  has  the  following  general 
formulation.  Given  two  hypotheses  Ho  and  Hi,  one  wishes  to  develop  a  nonparametric 
test  of  Ho  versus  H\ ,  on  the  basis  of  an  observed  ranking  (or  permutation)  of  n  individuals 
(or  items) .  In  this  paper,  attention  is  concentrated  only  on  the  problem  of  finding  a  suitable 
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test  statistic,  rather  than  on  the  closely  related  (and  important)  topic  of  the  distribution 
of  this  statistic  under  Ho- 

As  a  simple  example,  let  the  hypothesis  H\  be  that  women  perform  better  than  men 
on  a  certain  test,  and  Ho  is  that  they  do  not.  The  available  data  are  the  test  scores 
of  a  group  of  ni  men  and  n j  women,  ni  ■+■  nj  =  n.  If  normal  distribution  theory  is  not 
applicable,  one  wants  to  find  a  nonparametric  test  statistic  for  Ho  versus  H 1,  using  only 
the  ranks  of  the  n  individual  scores. 

The  proposed  general  solution  to  such  problems  —  whether  they  be  the  two-sample 
location  problem  illustrated  above,  or  some  other  nonparametric  testing  situation  —  is 
based  on  the  distance  between  two  sets  of  permutations.  It  is  implemented  in  the  following 
steps: 

Step  1:  Collect  all  data  relevant  to  the  problem,  and  rank  order  all  of  these 

observations.  This  produces  a  single  permutation,  which  we  denote  by  ir.  (To  simplify  the 
presentation,  we  will  assume  throughout  that  all  data  come  from  continuous  probability 
distributions,  so  that  there  are  no  ties  in  the  permutation  7r.) 

Step  2:  Identify  the  set  of  permutations,  which  (for  the  testing  problem  at  hand) 
are  equivalent  to  the  observed  permutation  n.  This  is  an  equivalence  class  of  permutations, 
which  we  denote  by  [tt]. 

Step  3:  Identify  the  set  E  of  extremal  permutations,  consisting  of  all  permutations 
which  are  least  in  agreement  with  Ho,  and  most  in  agreement  with  H\. 

Step  4:  To  test  Ho  versus  H 1,  let  d  be  a  metric  on  permutations,  and  consider  the 
minimum  pointwise  distance  between  the  sets  in  Steps  2  and  3: 

d(M,  E)  =  min  d{a,0). 

peE 
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This  is  the  proposed  test  statistic. 

Since  d([x],i?)  measures  the  distance  from  [tt]  to  the  set  of  permutations  which  are 
most  in  agreement  with  H\t  it  follows  that  one  wants  to  reject  Ho  for  small  values  of 
d([jr],.E).  This  contrasts  with  the  form  of  some  parametric  tests,  where  one  computes 
the  distance  from  Ho,  and  rejects  Ho  if  this  distance  is  large.  However,  as  illustrated 
especially  by  the  trivial  case  of  testing  for  trend  (Section  12),  measuring  the  distance  from 
Hi  is  really  the  only  natural  way  to  proceed  in  the  nonparametric  situation. 

Another  immediate,  and  intuitively  appealing,  property  of  the  proposed  test  statistic 
d([?r],.E),  is  that  d([ir],E)  attains  its  extremal  value  (zero)  if  and  only  if  the  observed 
permutation  x  is  equivalent  to  some  extremal  permutation  e  e  E.  In  other  words,  the 
strongest  evidence  for  rejecting  Ho  occurs  if  and  only  if  x  is  equivalent  to  some  e  e  E.  This 
fact  follows  trivially  from  the  basic  metric  property  that  d(x,a)  =  0  if  and  only  if  x  =  a. 

Mathematically,  the  sets  of  permutations  in  Steps  2  and  3  can  often  be  given  group- 
theoretic  descriptions.  The  group  theory  provides  a  convenient  notation  for  these  sets, 
and  it  simplifies  the  derivations  of  the  induced  test  statistics,  but  it  does  not  appear  to  be 
absolutely  essential  to  the  theory. 

Although  the  basic  construction  is  rather  abstractly  formulated,  it  gives-  rise  to  con¬ 
crete  results.  Indeed,  as  discussed  below,  this  procedure:  (1)  works  in  a  variety  of  testing 
situations;  (2)  gives  rise  to  many  familiar  nonparametric  test  statistics;  (3)  produces  sev¬ 
eral  other  test  statistics  which  are  less  familiar,  yet  equally  plausible;  and  (4)  enables  us 
to  extend  our  tests  to  other  hypothesis  testing  situations.  Moreover,  the  tests  produced 
by  this  procedure  have  nice  statistical  properties,  which,  in  turn,  suggest  possible  future 
areas  of  investigation. 
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3.  Some  Statistically  Meaningful  Metrics  on  the  Permutation  Group 

In  Section  2  a  method  of  generating  rank  tests  was  described  briefly,  which  uses  metrics 
on  permutations.  Before  giving  a  detailed  illustration  of  this  methodology  in  Section  4, 
it  will  be  helpful  to  list  several  especially  important  metrics,  and  to  introduce  a  suitable 
notation  for  these  metrics. 

To  any  ranking  of  n  items,  there  corresponds  an  element  ?r  of  the  permutation  group 
Sn.  Thus  ir:  {l,...,n}  — *  n}  is  a  function.  We  make  the  important  convention 

that  7r(t)  is  the  rank  assigned  to  item  i. 

Diaconis  (1982)  lists  the  following  metrics  on  Sn,  which  are  among  the  most  widely 
used  in  applied  scientific  and  statistical  problems: 

K(ir,a )  =  number  of  pairs  of  items,  (:,  j),  such  that  ?r(t)  <  n(j)  and  o(t)  >  o(j) 

=  minimum  number  of  pairwise  adjacent  transpositions  of  items  needed  to 
transform  ir  into  a 
is  Kendall’s  tau; 

n 

F(ir,<?)  =  Y  K(0  ~  tf(OI  Is  Spearman’s  footrule; 

»'«i 

(tt(0  —  <t(i))2)1^2  is  Spearman’s  rho; 

H (tt,  a)  =  #  {i  =  1, . . . ,  n:  tt(i)  yt  <7(1) }  is  Hamming  distance; 

U(n,o)  =  n—  the  length  of  the  longest  increasing  subsequence  in  .  ,a‘K~1(n) 

=  n—  the  maximal  number  of  items  ranked  in  the  same  relative  order  by  ?r  and  a 
is  Ulam’s  distance. 

In  the  preceding  definitions  of  the  metrics,  a  couple  of  points  require  clarification: 

(1)  For  any  finite  set  f2,  denotes  the  number  of  elements  in  fi.  Thus,  Hamming 
distance  is  simply  the  number  of  items  which  are  assigned  different  ranks  by  the  two 
permutations. 
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(2)  The  more  commonly  used  forms  of  Kendall’s  tau,  Spearman’s  footrule,  and  Spear¬ 
man’s  rho  are  obtained  by  replacing  the  metric  d  by  1  —  2 d/M,  where  M  —  M{d)  — 
max  d(r,a).  This  has  the  advantage  of  renormalizing  the  possible  values  to  lie  between 
— 1  and  1,  so  that  they  can  be  interpreted  in  roughly  the  same  way  as  a  correlation  coef¬ 
ficient.  However,  for  the  purpose  of  constructing  nonparametric  tests,  we  will  work  with 
the  original  metric  versions  d,  rather  than  the  transformed  versions  1  —  2d/M.  Indeed, 
our  whole  approach  is  based  on  using  d  as  a  measure  of  the  distance  between  two  sets  of 
permutations.  The  prevailing  practice  of  renormalizing  these  metrics  may  explain  why  a 
metric-based  approach  to  rank  tests  has  not  been  previously  explored  (at  least,  not  in  the 
context  of  the  present  paper). 

Kendall’s  tau  and  Spearman’s  rho  are,  of  course,  widely  used  nonparametric  “measures 
of  association”  for  two  rankings.  Spearman’s  footrule  has  recently  enjoyed  renewed  interest 
(Diaconis  and  Graham  (1977),  Feigin  and  Alvo  (1986)).  Hamming  distance  is  utilized 
in  coding  theory  to  measure  the  discrepancy  between  two  binary  sequences  (Berlekamp 
(1968),  Cameron  and  Van  Lint  (1980)).  Ul&m’s  metric  is  used  in  DNA  research  to  measure 
the  distance  between  two  strings  of  molecules  (Ulam  (1972,  1981)),  and  has  also  been 
investigated  in  the  statistical  literature  as  a  nonparametric  measure  of  association  (Gordon 
(1979)). 

Diaconis  (1982)  has  a  beautiful  and  enlightening  discussion  of  these  metrics  on  the 
permutation  group.  This  discussion  is  summarized,  and  extended,  in  the  recent  monograph 
of  the  present  author.  In  particular,  see  Critchlow  (1985)  for  a  more  complete  description 
of  these  metrics  and  their  defining  properties  (pages  5-11),  and  for  a  host  of  additional 
applications  of  these  metrics  to  a  different  collection  of  statistical  problems,  involving  fully 
ranked  and  partially  ranked  data  (pages  97-129).  Finally,  for  further  related  work,  see  the 
paper  of  Feigin  and  Alvo  (1986)  on  a  metric-based  approach  to  comparing  two  populations 
of  rankers,  using  Rao’s  (1982a,  1982b)  apportionment  of  diversity;  and  see  the  papers  of 


6 


Mallows  (1957),  Feigin  and  Cohen  (1978),  and  Fligner  and  Verducci  (1985)  on  metric-based 
ranking  models. 

4.  The  Two-Sample  Location  Problem,  with  Ordered  Alternatives 

The  basic  construction  will  be  illustrated  now,  in  detail,  for  the  two-sample  location 
problem  with  ordered  alternatives.  In  particular,  we  will  proceed  methodically  through 
the  four  steps  described  in  Section  2,  and  will  discuss  the  implementation  of  each  step  for 
the  problem  at  hand. 

Recall  the  example  in  which  the  data  are  the  test  scores  of  ni  men  and  n2  women, 
fix  -+■  n2  =  n.  Let  F\  and  F2  be  the  (population)  distribution  functions  for  men’s  and 
women’s  scores,  respectively.  For  the  two-sample  location  problem,  the  null  hypothesis  is 
H0:  Fi(x)  s  F2(x)  (i.e.  men  and  women  perform  equally),  and  one  possible  formulation 
of  the  alternative  is  Hi:  Fi(x)  >  F2(x),  with  strict  inequality  for  some  x  (i.e.  women 
score  higher). 

Step  1:  The  relevant  permutation  it  is  the  observed  rank  ordering  of  all  n  individuals. 
We  make  the  following  conventions:  individuals  1, . . . ,  and  n.i  4- 1, . . . ,  ni  4-  n2  axe  from 
the  first  population  (men)  and  second  population  (women),  respectively.  (This  arbitrary 
labeling  of  the  individuals  can  be  performed  without  any  loss  of  generality,  because  all  of 
the  metrics  under  consideration  are  right-invariant ;  that  is,  they  satisfy  d(it,c)  =  d(irr,  or) 
for  all  it,  a,  reSn.  See  Diaconis  (1982),  Critchlow  ((1985),  pages  10-11),  or  the  Appendix 
of  the  present  paper  for  a  discussion.)  / 

Also,  recall  ir(t')  is  the  rank  given  to  individual »,  with  low-numbered  ranks  (i.e.  ranks 
1,  2,  3,  etc.)  corresponding  to  low  scores.  (In  general,  in  the  future,  we  will  continue  to 
follow  the  convention  that  “items  are  ranked  from  least  to  greatest.”)  This  completely 
specifies  it  as  a  member  of  the  permutation  group  Sn. 
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Step  2:  For  the  two-sample  location  problem,  a  permutation  a  is  “equivalent”  to 
the  observed  permutation  ir  if  and  only  if  it  assigns  the  same  set  of  ranks  to  population  X, 
i.e.  if  and  only  if  a{l, . . . ,  ni}  =  7r{l, . . . ,  nj}.  This  definition  is  justified  formally  by  the 
theory  of  maximal  invariants  (Lehmann  (1959)):  for  this  problem,  the  maximal  invariants 
are  the  set  of  ranks  assigned  to  population  1;  a  is  equivalent  to  rr  if  and  only  if  it  yields 
the  same  values  of  the  maximal  invariants.  The  intuitive  idea  is  that  any  such  a  provides 
the  same  information  as  n  for  the  testing  problem  at  hand. 

The  equivalence  class  [?r]  is  thus  a  subset  of  Sn,  consisting  of  nilr^!  permutations.  In 
group-theoretic  notation,  [tt]  is  the  left  coset  t Sni  x  S„a,  where  Snt  *  Sn2  is  the  subgroup 
{aeSn:  a[i)  <  n.j  V  i  <  ni  and  <r(i)  >  ni  V  i  >  n i}. 

Step  3:  The  extremal  set  E  consists  of  all  permutations  which  are  most  in  agreement 
with  H\,  and  least  in  agreement  with  Ho-  Intuitively,  these  are  the  permutations  which 
rank  all  individuals  in  the  first  population  before  all  individuals  in  the  second  population. 
Pictorially,  E  consists  of  all  rankings  of  the  form 


i»i  n7 


where  X's  and  O’s  denote  individuals  from  populations  1  and  2,  respectively.  Mathemat¬ 
ically,  E  is  the  subgroup  Sn,  >'  Sna  described  in  Step  2. 

Step  4:  The  test  statistic  for  Ho  versus  H\  is  given  by  the  minimum  interpoint 
distance  between  the  sets  constructed  in  Steps  2  and  3: 

d{\ir),E)  =  min  d{a,0). 

Ot€TSn  j  X 


It  remains  to  describe  these  test  statistics  explicitly,  for  various  choices  of  the  metric  d  on 


Recall  that  two  test  statistics  are  equivalent  if  one  can  be  obtained  from  the  other 
by  a  monotone  transformation.  The  following  interesting  result  says  that  the  test  statis¬ 
tics  induced  by  the  metrics  of  Section  3  axe,  in  fact,  equivalent  to  some  well  known  test 
statistics. 

To  state  the  result,  we  introduce  the 

Notation:  Let  Ni  and  N2  be  the  sets  N2  —  {1, . . . ,  nx}  and  N2  =  {nx  + 1, . . . ,  ni  + 
n2}.  Thus  N\  and  N2  axe  the  sets  of  labels  for  the  items  in  the  first  and  second  populations, 
respectively. 

Let  a\  <  ...  <  ani  be  the  ranks  assigned  by  it  to  the  first  population;  that  is, 
{ai,...,ani}  is  the  set  (7r(l), . . . ,  jr(ni)}  listed  in  ascending  order.  Similarly  let  ani  +  x  < 
. . .  <  ani  +na  be  the  ranks  assigned  by  it  to  the  second  population. 

For  *  =  1, . . , , »  let 

Ui  =  ir(j)  <  t}, 

ti2  =  *•  -  tn  =  #{jtN2:  *ti)  <  *}• 

Thus  tn  is  the  number  of  items  from  the  first  population,  which  fall  among  the  top  i  ranks, 
while  ti 2  is  the  corresponding  number  for  the  second  population. 

Finally,  let  Fi(x)  and  F2(x)  be  the  empirical  cumulative  distribution  functions  for  the 
first  and  second  populations,  respectively. 

Theorem  1:  The  test  statistics  for  the  two-sample  location  problem  with  ordered 
alternatives,  induced  by  the  metrics  of  Section  3,  axe: 

K{itSni  x  S„3,Sn,  x  Sn,)  =  #{(*', j)cNi  x  N2:  jt(»)  >  tt(j)}. 

This  is  the  Mann-Whitney  (1947)  test  statistic. 

F{itSn j  x  Sn3,Sni  x  Sn^)  =  2  'y  ^  —  ^1(^1  +  !)• 

t=i 
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This  is  equivalent  to  the  Wilcoxon  (1945)  test  statistic  jr(t). 

»=i 

x  S„„Sni  x  S„.)  =  £>  -  -  2 

i«=l  i=l 

n 

This  is  equivalent  to  the  test  statistic  £2  *<*»• 

«=i 

B (xSjij  x  Sna ) 5jh  x  S„a )  — -  2#({x(l),. . . ,  Tr(fti)}  n  {ni  4*  +  **2}) 

=  2#(*(tf1)njv2). 

For  equal  sample  sizes  (rti  =  n2  =  n/2) ,  this  is  equivalent  to  the  Mood  (1950)  “median 
test”  statistic  =  #{n{Ni)  D  iST2)  =  #[ir{N\)  n  {n/2  +  l,...,n}). 

U[itSni  x  > Sna,Sni  x  Snj)  =  n^  —  max{t*i  ”t»2}. 

t 

For  equal  sample  sizes,  this  is  equivalent  to  the  Kolmogorov-Smimov  test  statistic 

“  “f*  { £  '  £ } = i h' (l) "  A(l) ) ■ 

Proof:  The  derivations  of  these  results,  and  of  the  corresponding  results  in  succeed¬ 
ing  sections,  are  deferred  to  the  Appendix.  We  simply  remark  here  that  some  elementary 
group-theoretic  and  combinatorial  properties  are  helpful  to  the  arguments.  □ 

Remarks:  It  is  well  known  that  the  Mann- Whitney  test  statistic  and  the  Wilcoxon 
test  statistic  are  equivalent.  Yet  they  are  induced  by  different  metrics  —  namely,  K  and 
F,  respectively.  This  is  not  particularly  surprising,  since  it  is  entirely  possible  for  two 
different  metrics  to  produce  the  same  minimum  interpoint  distance  between  the  sets  [x] 
and  E. 

The  test  statistic  induced  by  Spearman’s  rho  has  not  appeared  previously  in  the 
literature,  as  far  as  the  author  is  aware.  However,  it  bears  a  strong  resemblance  to  one 
form  of  the  Cramer-von  Mises  test  statistic 

*1  (a*  ~1’)2  +  n2  YL  («i-(*“ni))2 

t=i  »=m  +  i 


S 


>V.  -%  V .%W_%W_vV.*  sV.*  V  */  C  *: 
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(Hajek  and  Siddk  (1967),  page  93),  and  also  to  Page’s  (1963)  test  for  the  two-way  lay¬ 
out  with  ordered  alternatives.  This  latter  resemblance  is  not  a  coincidence,  as  Section  8 
demonstrates. 

Of  course,  although  the  test  statistics  of  Theorem  1  are  among  the  most  well  known, 
several  other  rank  test  statistics  have  been  proposed  for  the  two-sample  location  problem. 
As  discussed  in  Sections  13.2  and  13.3,  nearly  all  such  statistics  are  induced  by  our  basic 
construction,  by  suitable  metrics  on  permutations. 

*  *  * 

The  basic  construction  carries  over  to  other  hypothesis  testing  situations.  As  a  result, 
the  construction  yields  not  just  a  collection  of  test  statistics,  but  also  a  collection  of  families 
of  test  statistics.  For  example,  since  Ulam’s  metric  U  induces  the  Kolmogorov-Smirnov 
statistic  for  the  two-sample  location  problem  with  ordered  alternatives,  we  may  regard 
the  statistics  induced  by  U  in  other  situations  as  natural  extensions  of  the  Kolmogorov- 
Smirnov  statistic. 

For  each  testing  situation,  the  equivalence  class  [?r]  and  the  extremal  set  E  must  be 
identified,  before  computing  the  minimum  interpoint  distance  between  these  two  sets. 

5.  The  Two-Sample  Location  Problem,  with  Unordered  Alternatives 

For  the  two-sample  location  problem  with  unordered  alternatives,  the  equivalence 
class  [tt]  =  7rSni  x  is  the  same  as  for  the  ordered  alternatives  case.  However,  the 
extremal  set  E  must  be  expanded  to  include  not  only  those  permutations  which  rank  all  of 
population  1  before  population  2,  but  also  those  permutations  which  rank  all  of  population 
2  before  population  1.  In  group-theoretic  notation,  E  =  52  O  ( Sni  x  S„a),  where,  as  in 
Section  4, 

S„,  x  5„a  =  {<7cS„:  <r(t)  <  o{j)  V  teJVj  V  j'eiVj}  =  {atSn\  cr[Nx)  <  o{N2)}, 
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and  where  the  “dot  product"  52  ©  (Sni  x  Sna)  is  defined  as  {0  £  Sn:  3  at  S2:  0{No(i))  < 
2))}-  (Here  and  henceforth,  the  notation  Cl\  <  fl2,  where  nx  and  02  are  sets  of 
numbers,  means  that  u>i  <  w2  for  all  wj  £  Hi  and  w2  t  fi2.) 


Our  basic  construction  now  gives  rise  to  the  usual  two-sided  versions  of  the  test  statis¬ 
tics  presented  in  Section  4: 

Theorem  2:  The  test  statistics  for  the  two-sample  location  problem  with  unordered 
alternatives,  induced  by  the  metrics  of  Section  3,  are: 

K(*s„,  x  S„„S,  0  S„.  X  S„.)  =  -  |#{(i,j)tAr,  x  N2:  »(.')  >  »(>)}  -  2£l|. 

This  is  equivalent  to  the  Mann- Whitney  (1947)  two-sided  test  statistic  =  \#{{i,j)tNx  x 
N2:  *(0  >  Jt(j')}  -  nin3/2|. 

F(xS»,  X  S„„S,  ©  S„,  X  S„,)  =  n,n,  -  2|  f'  *(i)  -  |. 

tail 

This  equivalent  to  the  Wilcoxon  (1945)  two-sided  test  statistic  =  (  £  r(t)  —  ni(»+ 1)/2|. 

im  1 

(^^nj  x  Sn7,S2  ©  x  Sna) 

n.(n  +  l)(2n  +  1)  ^  .  f  ^ 

= - - - 2  2^  »a. -2  max^0,n2  2^jt(i)  -  m  2w  MO  (  • 

<■1  l  t*l  »*», +1  J 

This  is  the  two-sided  version  of  the  new  test  statistic  introduced  in  Section  4. 


H(xSni  x  Sna ,  S2  ©  Sni  x  S„a)  =  2  min  {#(jt(JVx)  n  W2), #(sr(JVTx)  n  {1, . . . ,  n2»> . 


For  equal  sample  sizes  (nx  =  n2  =  n/2),  this  is  equivalent  to  the  Mood  (1950)  two-sided 
“median  test”  statistic 


-l#(»W)n{|  +  i . 


C7(rS„,  x  Sn,,S3  ©  S„,  x  S„,)  =  ^  -  mix  —  +  t„  - 
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For  equal  sample  sizes,  this  is  equivalent  to  the  Kolmogorov-Smimov  two-sided  test  statistic 

=  max  j—  -  — |  =  max|Fi(z)  -  F2(*)|- 

*  Til  n2  x 

Proof:  See  the  Appendix.  □ 

6.  The  Multi-Sample  Location  Problem,  with  Ordered  Alternatives 

This  is  a  straightforward  generalization  of  the  two-sample  situation  considered  in 
Section  4.  Suppose  we  have  samples  from  r  >  2  populations,  with  sample  sizes  n1? . . . ,  nr, 

r 

where  £  nj  —  n.  Let  Fj  be  the  cumulative  distribution  function  for  the  j’-th  population. 
y*i 

The  null  hypothesis  is  H0:  Fx(x)  s  ...  5  Fr(x),  and  one  possible  formulation  of  the 
alternative  is  Hi’.  Fi(x)  >  F2(x)  >  . . .  >  Fr(x),  where  each  inequality  is  strict  for  some 
x. 

We  proceed  to  implement  Steps  1  through  4  of  the  basic  construction. 

Step  1:  The  permutation  ?r  is  the  observed  rank  ordering  of  all  n  items,  with  the 
conventions:  items  1, . . . ,  nj  are  from  the  first  population,  items  ni  +  1, . . . ,  ni  +  n2  are 
from  the  second  population,  and  so  on.  Let  Ni,Nif...,Nr  be  the  corresponding  sets  of 
labels: 

Ni  — 

N7  —  {ni  +  1,...  ,ni  +  n2} 

Nr  =  {tii  -j- . . .  +  nr_x  +  1, . . . ,ni  +  . . .  +  Tir_i  +  Tir}. 

Step  2:  The  equivalence  class  [tt]  =  {ae5n:  a{Nj)  =  7r(A^,)V  j  =  1, ...,r}  consists 
of  all  permutations  a  which  assign  the  same  set  of  ranks,  that  n  does,  to  each  of  the  r 


populations.  Thus  [jrj  is  the  left  coset  ir Sni  x  ...  x  Snr,  where  Sni  x  ...  x  S»r  is  the 
subgroup  {<r  e  Sn-  c(Nj)  =  JV,-  V  j  =  1, . . . ,r},  or,  equivalently,  where  Sn%  x  ...  x  Snr  = 
{a  e  Sn:  a(Ni)  <  a(Ni)  <  ...  <  a{Nr)}. 

Step  3:  The  extremal  set  E  is  5ni  x  ...  x  Snr.  In  words,  E  consists  of  the 

permutations  which  rank  all  items  in  the  first  population  before  ail  items  in  the  second 
population,  which  in  turn  are  ranked  ahead  of  everything  in  the  third  population,  etc. 

Step  4:  To  present  the  minimum  interpoint  distances  between  the  sets  [ir]  and  E, 
it  is  necessary  to  further  generalize  our  previous  notation. 


Notation:  Let  a*  <  . . .  <  ani  be  the  ranks  assigned  by  tt  to  the  first  population, 
On1+i  <  . . .  <  Onj+na  the  ranks  assigned  by  ?r  to  the  second  population,  and  so  on. 

For  t  =  1, . . .  ,n  and  j  =  l,...,r,  let  e  Nj‘.  ir(k)  <  *}  be  the  number  of 

items  from  the  j-th  population,  which  fall  among  the  top  i  ranks. 

Let  Fj  (x)  be  the  empirical  cumulative  distribution  function  for  the  j-th  population. 


As— 1 

£)• 
j-  i 


Finally,  Y1  denotes  summation  over  all  ordered  pairs  (j,  k)  such  that  j  <  k  (and  not 
j<k 


Theorem  3:  The  test  statistics  for  the  r-sample  location  problem  with  ordered 
alternatives,  induced  by  the  metrics  of  Section  3,  are: 


K(irSni  x  ...  x  Sn,,5n,  x  ...  x  Sn,)  =  ^#{(t',m)  £  Nj  x  Nk:*{ t)  >  ”■("*)} 

;<As 


This  is  the  Jonckheere  (1954)-Terpstra  (1952)  test  statistic. 


Sn |  X  ...  X  SnrfSni  X 

R  (irSfii  x  ...  x  5n,i ^«i  ^ 
B[irSn  i  x  ...  x  Snr,Sni  x 

U(nSni  x...x  SflrlRni  x 


=  YL  lfl‘  ~  *1- 

«*)  =  £>  -  0*  -  n(,1  +  1),(:!"  +  1-)  -  2 1 


i-1 


«-l 


*  SO  =  »- nW,). 


i-1 


f-1 


x  5nJ  =  (n  -  nr)  -  max  ]£(««,/  -  «*,./+ 1) 

*1  S-S*r- 1  T“T 

;=i 


Proof:  See  the  Appendix.  □ 


Remarks:  The  Jonckheere-Terpstra  statistic  is  usually  viewed  as  the  extension  of 
the  Mann- Whitney  statistic  to  the  multi-sample  situation.  Similarly,  the  statistic  induced 
by  F  is  a  natural  extension  of  the  Wilcoxon  test  statistic,  and  the  statistics  induced  by  R2t 
H ,  and  U  are  natural  extensions  of  the  corresponding  statistics  in  Section  4.  In  particular: 

(1)  The  Mann- Whitney  and  Wilcoxon  tests  are  not  equivalent  for  more  than  two 
samples. 


(2)  For  equal  sample  sizes  from  r  >  2  populations,  the  statistic  induced  by  H  is 
equivalent  to  the  following  extension  of  Mood’s  median  test  statistic: 

r 

£  #(*(*,)  n jvj). 

i-i 


(3)  For  equal  sample  sizes  from  r  >  2  populations,  the  statistic  induced  by  U  is 
equivalent  to  the  following  extension  of  the  Kolmogorov-Smirnov  test  statistic: 

•  <m^  £  (t1  "  =  x  £  (^i(*i)  ~  (2>))* 

"  v  n;  nj  +  l  /  *»<-<*-» 
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7.  The  Multi-Sample  Location  Problem,  with  Unordered  Alternatives 

For  the  unordered  alternatives  case,  the  equivalence  class  [7r]  =  irSni  x  . . .  x  $nr  is  the 
same  as  in  the  preceding  section,  but  the  extremal  set  E  now  includes  all  of  the  extremal 
permutations  for  all  of  the  r!  different  possible  ordered  alternatives.  Thus  E  =  Sr  ©  (Sni  x 
...xSnJ,  where  the  dot  product  Sr©(Sni  x . . .  x  Snp)  equals  {/?  £  Sn:  3  a  t  Sr:  0{N<r( X) )  < 
•  •  •  < 

The  metrics  K ,  Ft  R,  H,  and  U  all  induce  very  natural  competitors  to  the  Kruskal- 
Wallis  (1952)  test  statistic.  Each  of  these  competitors  is  a  test  statistic  which  measures 
the  distance  to  the  closest  ordered  alternative.  The  test  statistic  is  thus  a  minimum  of  test 
statistics  of  the  type  presented  in  the  preceding  section,  with  the  minimization  being  done 
over  all  the  r!  ponible  ordered  alternatives.  In  the  statement  of  Theorem  4,  this  amounts 
to  minimizing  with  respect  to  a  permutation  a  e  Sr. 

Notation:  For  j  =  l,...,r  and  t  =  1, . . . ,  ny,  let  ay  <  ...  <  any;  be  the  ranks 

assigned  by  r  to  population  j. 


Theorem  4:  The  test  statistics  for  the  r-sample  location  problem  with  unordered 
alternatives,  induced  by  the  metrics  of  Section  3,  are: 

K(*Sni  x  . . .  x  Snr,  Sr  ©  5ni  x  . . .  x  5„r)  =  min  V  #{(*»  x  :  »(*)  >  *("»)}• 


}<k 


r  »*(/)  j-l 

F[^Sn j  x  ...  x  Snr ,  Sr  ©  Sni  x  ...  x  S*r)  —  oiin  ^  ^  l®»v(j)  (*  ^  ***(&))!• 

rj  =  l  is  1  k=l 


r  n*li) 


j-l 


R2{*Sni  X  ...  X  Snr,SrOSni  X  ...  X  Sn„)  =  mm  V  («,>(/)  -  (»  +  *<r(fe)))2 

l  i*l 

=  —— 1^t— -  -  a«v (/)(*'  +  M*))* 


;= 1  i=l 


*=1 


16 


X  ...  X  Snr  ,  Sr  ©  Sn,  X  ...  X  Snr) 

r  j  —  1  j 

-»-m«  51  n*(*)  <  *(0  ^  ]C  ”'(*)}• 

*  '  j*l  fc=l  fc=l 

U{irSnx  X  ...  X  5nri  5r  ©  5n,  X  ...  X  5n,) 

r  —  1 

=  n  -  {n»(r)  +  £(<v»t>)  “  W+i))>- 

»c5r  J*1 

Proof:  See  the  Appendix.  □ 

Remarks:  (1)  With  the  aid  of  modern  computing  facilities  and  suitable  minimization 
algorithms,  it  should  be  feasible  to  calculate  any  of  the  preceding  test  statistics,  even  for 
moderately  large  r. 

(2)  For  equal  sample  sizes  from  r  >  2  populations,  the  statistic  induced  by  U  is 
equivalent  to  the  following  extension  of  the  Kolmogorov-Smirnov  statistic: 

r— 1 

atSr  Jml 

8.  The  Two-Way  Layout,  with  Ordered  Alternatives 

Suppose  there  are  b  blocks,  r  populations,  and  r  items  in  each  block  (one  from  each 
population).  Let  Xu  be  the  observation  from  the  x'-th  block  and  the  j-th  population.  The 
two-way  layout  model  is 

Xij  =  m  +  +  Tj  +  etJ,  for  »  =  1, . . . ,  b  and  j  =  1, . . . ,  r, 

where  n  is  the  unknown  overall  mean,  -y ,•  is  the  x-th  block  effect,  r;  is  the  j'-th  population 

i  r 

effect,  the  are  i.i.d.  noise  variables,  and  £  H  =  0  =  J2  Ti- 

t=l  ;=1 

For  the  ordered  alternatives  case,  we  are  testing  Ho:  rx  =  ...  =  rr  versus  Hi :  tx  < 
. . .  <  rr,  where  at  least  one  of  the  inequalities  is  strict.  We  proceed  to  implement  Steps  1 
through  4  of  the  basic  construction. 


Step  1:  The  observed  permutation  n  is  the  ranking  of  all  the  n  =  rb  items.  We 
make  the  convention  that  item  (t  —  l)r + j  is  from  block  i  and  population  j,  for  t  =  1, . . . , 6 
and  j  =:  1, . . . ,  r.  As  usual,  tt((*  —  l)r  +  j)  is  the  rank  given  to  this  item. 

Step  2:  For  the  two-way  layout,  two  permutations  t,  a  e  Sn  are  equivalent  if 
and  only  if,  within  ?'.ch  block,  they  induce  the  same  permutation  of  the  r  populations. 
Formally,  the  permutation  e  Sr  within  the  *-th  block,  induced  by  ?r,  is  defined  by 

*iti)  =  #{*  =  ?r((»  —  l)r  +  k)  <  »((*  -  l)r  +  /)} 

for  *  =  1, . . . ,  6  and  j  =  1, . . . ,  r.  Similarly,  let  a,  £  Sr  be  the  permutation  within  the  t-th 
block  induced  by  a.  Then  ir,a  e  Sn  are  equivalent  if  and  only  if  (»ri, . . . ,  *>)  =  (ai , . . . ,  ar), 
and  the  equivalence  class  [jt]  is  {a  e  S„:  (s-j , . . . , irr)  =  (alt ... ,  ar)}. 

Step  3:  The  extremal  set  E  is  the  equivalence  class  containing  the  identity  per¬ 
mutation  e.  Thus  E  —  [e]  consists  of  ail  permutations  which,  within  each  block,  rank 
population  1  first,  population  2  second,  and  so  on. 

Step  4:  For  each  of  the  metrics  d  =  K,  F,  R2 ,  and  U ,  the  test  statbtic  induced  by 
d  turns  out  to  be  the  sum  of  the  within-block  distances;  that  is, 

<*([»!.[«])  ~  ]£  <*(»«.«)• 

i=i 

(There  is  some  abuse  of  notation  here:  d  on  the  left  hand  side  is  a  metric  on  S„,  whereas 
d  on  the  right  hand  side  is  the  corresponding  metric  on  Sr.  Also  e  on  the  left  hand  side  is 
the  identity  permutation  in  Sn,  whereas  e  on  the  right  is  the  identity  permutation  in  Sr. 
Finally,  we  are  referring  to  JR2  as  a  “metric,”  even  though  it  does  not  satisfy  the  triangle 
inequality.) 

The  induced  test  statistics  are  given  by  Theorem  5  below. 


Notation:  We  introduce  the  abbreviation  “LLIS”  for  the  phrase  “length  of  the 


longest  increasing  subsequence.” 


Theorem  5:  The  test  statistics  for  the  two-way  layout  with  ordered  alternatives, 
induced  by  the  metrics  K ,  F ,  R,  and  U  of  Section  3,  are: 


=  II  K{iri,e) 

1=1 

b 

=  £  {1,  -  -  - ,  »->2:  3  <  k  and  *,•(/)  >  *■»(&)}. 


i=i 

6 


b  r 


**([»].  i«i)  =  H  kco  -ji 

tal  t=l  ;=1 

=  E  *’(*.•.«)  =  EE 

«=1  t=l  ;=1 

=  Mr  +  l)(2r+l)  _2  j.sj.  ^ 

J=1  t=l 

r  b 

This  is  equivalent  to  the  Page  (1963)  test  statistic  J2  J  02  ndj)- 

J»1  »«sl 

b  b 

^(W.(eJ)  =  H  =  «  -  H  £.L/S{jr,(l),...,?r,(r)}. 


t=i 


»=  l 


Proof:  See  the  Appendix.  □ 


Remarks:  For  other  hypothesis  testing  situations,  the  test  statistics  induced  by 

R  —  in  particular,  the  new  two-sample  location  test  statistic  introduced  in  Section  4  — 
can  be  regarded  now  as  extensions  of  Page’s  test.  Similarly,  the  new  tests  in  the  present 
section  are  extensions  of  previous  tests. 

The  statistic  induced  by  the  metric  H  is  not  given  in  Theorem  5,  and  indeed,  has 
not  yet  been  determined.  In  future  sections  of  this  paper,  analogous  omissions  will  have  a 
similar  meaning. 


9.  The  Two-Way  Layout,  with  Unordered  Alternatives 

For  the  unordered  alternatives  case,  the  equivalence  class  [tt]  is  the  same,  but  the 
extremal  set  E  consists  of  all  permutations  which,  within  each  block,  rank  the  populations 
in  the  same  order.  Thus  E  is  the  union  of  r!  equivalence  classes,  one  for  each  possible 
ranking  a  e  ST  of  the  r  populations.  Mathematically, 

E  =  Sr  ©  [«]  =  {0  e  Sn:  3  a  e  Sr:  Pi  -  a  V  *  =  1,...,6}. 

The  metrics  K ,  F,  R,  and  U  induce  new  test  statistics,  which  are  extensions  of  previous 
tests,  and  also  are  natural  competitors  to  the  Friedman  (1937)  -  Kendall-Babington  Smith 
(1939)  test.  Just  as  for  the  multi-sample  location  problem,  these  statistics  measure  the 
distance  to  the  closest  ordered  alternative.  Since  there  are  r!  possible  ordered  alternatives, 
the  expressions  for  the  induced  test  statistics  entail  a  minimization  with  respect  to  a 
permutation  a  e  Sr. 

Theorem  6:  The  test  statistics  for  the  two-way  layout  with  unordered  alternatives, 
induced  by  the  metrics  K,  F ,  R,  and  U  of  Section  3,  are: 

b 

-K(M>  Sr  ©  [e])  =  min  #(*-,-,  a) 

<7CSr 

isl 

b 

=  micny2#{(j,k)  e{l, . . . ,  r}3:  o{j)  <  a{k)  and  *•,•(/)  >  *, •(*)}. 

<TCSr 

»=1 

6  b  r 

F{\*b  Sr  ©  [e])  =  min  V  F(jrt-,a)  =  min  V  Kt(j')  - 


b 

.R2([7r],Sr  ©  [c])  =  mm  Vie2 (7T,-, a) 

(Te5f  *  ^ 
t=l 

=  ™5n  H  II (^0')  - 

9t  -  t=Xj  =  l 

j=i  »=i 

6 

o  [«])  =  min  T*  U{iri,c) 

<?cSr  '  " 

»=l 

b 

=  n  —  max  LX/S{7r,<7(l),7r,cr(2), . . .  ,rrta(r)}. 

GtSr  *  " 
t=l 

Proof:  See  the  Appendix.  □ 

Remarks:  In  the  case  of  the  statistic  induced  by  J?,  the  permutation  otSr  in  the 
preceding  theorem  is  determined  by  the  rule: 

b  b 

o{j)  <  a{k)  whenever  ir^j)  <  ^  *»(*)• 

»=i  »'=i 

b  b 

(If  ^  7T,  (/)  =  *«(&)>  then  the  relative  order  of  o(j)  and  cr[k)  may  be  chosen  arbitrar- 

i=l  t=l 

ily.) 

For  the  other  metrics,  the  permutation  c  e  Sr  can  be  found  numerically. 

10.  The  One-Sample  Location  Problem 

For  the  one-sample  location  problem,  there  are  nj  observations  Xi, . . . ,  Xn, ,  which 
are  assumed  to  follow  the  model  X,-  =  m  +  The  “noise  terms”  c,  are  independent  random 
variables,  symmetric  about  0.  We  now  implement  the  four  steps  of  the  basic  construction, 
considering  initially  the  case  of  ordered  alternatives.  The  hypotheses  are  Hq\  h  >  0  versus 


Step  1:  The  relative  ordering  of  the  observations  Xi,...,Xn,  provides  no  infor¬ 
mation  about  the  location  parameter  (j. .  So  we  create  a  superpopulation  of  n  =  2 4-  1 
observations,  namely:  Xi, . . . ,  Xni ,  0,  —  Xi, . . . ,  —  Xni .  The  relative  ordering  of  these  n 
items  contains  all  of  the  information  relevant  to  a  rank  test  of  Hq  versus  Hi,  and  thus 
defines  the  desired  permutation  ir  e  Sn.  For  *  =  1,... ,ni,  we  make  the  conventions:  ob¬ 
servation  Xi  corresponds  to  item  t,  — X,-  corresponds  to  item  n  +  1  —  t,  and  0  corresponds 
to  item  (n  +  l)/2  =  rti  -I- 1.  As  usual,  the  items  are  ranked  from  the  least  to  the  greatest, 
and  7r(x)  is  the  rank  given  item  t. 

Step'2:  The  equivalence  class  [it]  consists  of  all  permutations  which  give  Xi, . . . ,  X„, 
the  same  set  of  ranks  in  the  superpopulation  as  tt  (and  similarly  for  —  X\, . . . ,  —Xni).  To 
obtain  a  group-theoretic  description  of  [tt],  let  Ni  =  {l, . . . ,  ni }  be  the  set  of  labels  for  the 
observations  Xi,...,  Xni ,  let  N2  =  {n^  +  l},  and  let  N3  =  { n 1  +  2, . . . ,  n}  be  the  set  of 
labels  for  the  observations  -Xni . . ,  -X\.  Then  [tt]  is  the  left  coset  it  Sni  x  Si  x  Sn, , 
where  Sn,  x  Si  x  Sni  is  the  subgroup  {a  c  Sn:  o{Nj)  =  Nj  Vj  =  1,2,3}. 

Step  3:  The  extremal  set  E  is  the  subgroup  Sn,  x  Si  x  S„, .  Thus  E  consists  of  all 
permutations  which  assign  the  first  ni  ranks  to  Xi,...,Xn,  (in  some  order),  the  middle 
rank  to  0,  and  the  last  ni  ranks  to  —  X\, . . . ,  —  Xn,  •  This  means  that  for  an  extremal 
permutation,  all  X,  <  0. 

Step  4:  To  describe  the  induced  test  statistics,  it  is  helpful  to  introduce  the  following 

Notation:  For  tt  e  Sn  and  for  i  =  l,...,rti,  define  7r*(t)  =  |jt(»)  —  (n  +  l)/2|. 

Then,  for  it  constructed  from  the  observations  Xi, . . . , Xn,  as  above,  it  follows  that  tt*  is 
a  permutation  of  {l, . . . ,  ni}.  In  fact,  tt*(*)  is  the  rank  of  |X» |  in  the  set  {|Xi|, . . . ,  |Xn,  |). 

Let  ai  <  ...  <  an,  be  an  enumeration  of  the  set  Tr(JV’i)  =  7r{l, . . . ,ni},  let  un,  +  i  = 
Tij  +  1  =  +  l)>  and  let  ani+2  <  ...  <  an  be  an  enumeration  of  the  set  = 

7r{ni  +  2,. . .  ,n}. 
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Let  F\  ( x )  be  the  empirical  cumulative  distribution  function  for  the  data  set  X\t. 

Xni ,  and  let  .^(x)  be  the  empirical  c.d.f.  for  the  data  set  —  X\, . . . ,  —  Xni . 

Theorem  7:  The  test  statistics  for  the  one-sample  location  problem  with  ordered 
alternatives,  induced  by  the  metrics  K,  F,  R,  H,  and  U  of  Section  3,  axe: 

K{’kSTi1  x  Si  x  Sni,Snx  x  Si  x  Sni)  =  2  ^  ^  ic  (x). 

Xi>0} 

This  is  equivalent  to  the  Wilcoxon  (1945)  signed  rank  test  statistic  =  Y  ?r*(t). 

{tcATi:  Xi>0) 

F(nSni  x  Si  x  Sni,Sni  x  Si  x  Sni)  =  4  n  (x‘). 

{icN j  :  X.  >0) 

This  is  also  equivalent  to  the  Wilcoxon  signed  rank  test  statistic. 

n 

-ft  (7r‘^ti  x  Si  x  SB1 ,S„,  x  Si  x  Sni)  =  fa,  —  i)2 


=  ?h.t  V&J.!).  _  2  y.  «'<*• 


This  is  equivalent  to  the  statistic  Y  ia«- 

«'=i 


H{*Snx  x  Si  x  SBl,Sni  x  Si  x  Sni)  =  2 #{:'  e  JVj:  «*(*)  > 

=  2#{t  e  AT i :  Xt- >  0}. 

This  is  equivalent  to  the  Fisher  (1925)  sign  test  statistic  =  #{x  e  AV.  X,-  >  0}. 

U(ttS ni  x  Si  x  Sni,Sni  x  Si  x  Sn,) 

ni  +  1  -  rii  max{Fi  (x)  -  FWx)}  if  maxl/'i  (x)  -  ^(x)}  <  1 

=  <  1  * 

0  if  max{Fi(x)  —  ^(x)}  =  1. 

w  s 

This  is  equivalent  to  the  test  statistic  max{Fi(x)  +  F\{— x)}  -  1  (assuming  no  ties).  Thus 
the  statistic  max{Fi(x)  +  A(“ 2)}  can  be  regarded  as  an  extension  of  the  Kolmogorov- 
Smirnov  statistic  to  the  one-sample  location  problem. 


23 


Proof:  See  the  Appendix.  □ 

For  the  case  of  unordered  alternatives,  the  hypotheses  axe  Ho-  m  =  0  versus  H\\  n  #  0. 
The  equivalence  class  [?r]  is  the  same  as  before,  but  the  extremal  set  E  should  include  the 
situation  where  all  X,  >  0,  as  well  as  the  case  where  all  X»  <  0.  Thus 
E  =  S2©(Sn,  x  Si  x  SnJ 

=  {/?  c  Sn:  3  a  bijection  a:  {1,3}  -♦  {1, 3): =  Nj  V  j  =s  1,3}. 

We  obtain  the  natural  two-sided  versions  of  the  previous  tests: 


Notation:  For  i  =  1, . . . ,  n,  define  o»  by 

ani  +  l+t  *  —  1, . . . ,  Til 

a*  =  Oi  =  Til  +  1  for  »  =  Til  +  1 

a,'-*,-!  fort  =  m  +  2,...,n. 

Theorem  8:  The  test  statistics  for  the  one-sample  location  problem  with  unordered 
alternatives,  induced  by  the  metrics  K,  F,  R,  JET,  and  U  of  Section  3,  are: 

X  Si  x  Sni,Sj  ©  Sn,  X  Si  x  Sni) 

_  2  {  ni(ni  +  *)  _ 


r  *•(••»)  -  }  • 

X.>0  J  ) 


This  is  equivalent  to  the  two-sided  Wilcoxon  signed  rank  test  statistic  =  |  {  £  tt*  (t)  J  — 

Vx->o  / 

ni(ni  +  l)/4|. 


X  Si  X  Sn,,S2  ©  Snj  X  Si  X  Sn,) 

ni(n:  -i- 1) 


=  4 


(E"„,).a¥a|). 


This  is  also  equivalent  to  the  two-sided  Wilcoxon  signed  rank  test  statistic. 
R?(it Sn j  x  Si  x  Sn, ,  S2  ©  Sni  x  Si  x  Sni ) 

=  min{^(a,-  -  -  t)2} 


«=i 


i=i 


=  n(’t+1fnt1)  -2  id,  £ 


\-l  1=1 
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This  is  equivalent  to  the  test  statistic  max{^2”=1  *«<}• 

x  Sj  x  Sni,52  ©  5ni  x  Si  x  Sni) 

=  l#{>'  £  JVi=  *>0}-£|}. 

This  is  equivalent  to  the  two-sided  Fisher  sign  test  statistic  =  |#{i  e  JVj:  Xi  >  0}  —  »i/2|. 
U(jrSn ,  x  Si  x  Sni,S2  ©  Sni  x  Si  x  Sni) 

{rii  +  1  -  nx  max{Fi(x)  -  .F2(x)}  if  max{F’i(x)  -  .F2(*)}  <  1 
0  if  max{Fi(x)  —  F2(x)}  =  1. 

This  is  equivalent  to  the  test  statistic  max  |A(z)  —  A(s)|  ==  max  lAfc)  +  A(—z)  —  1|. 
Proof:  See  the  Appendix.  □ 


11.  The  Two-Sample  Dispersion  Problem,  with  Equal  Medians 


We  consider  initially  the  problem  of  ordered  alternatives.  Suppose  the  data  come 
from  two  populations,  with  (population)  cumulative  distribution  functions  Fi(x)  and 
F2(x),  respectively,  each  having  the  same  median  (i.  Further  assume  that  F2(x)  = 
jFi  (/x  +  (x  —  n)/^)  for  some  “dispersion  parameter”  7  >  0.  The  hypotheses  to  be  tested 
axe  Ho :  7  >  1  versus  H  1:7  <  1.  Thus  the  alternative  hypothesis  H\  is  that  the  second 
population  is  “less  spread  out”  than  the  first. 


Throughout  this  section  we  will  assume  that  the  two  sample  sizes,  ni  and  n2,  are  both 
even  numbers.  This  assumption  is  made  merely  for  the  sake  of  notational  convenience,  and 
to  illustrate  the  basic  concepts.  Similar  results  can  be  obtained  when  ni  or  n2,  or  both, 
are  odd  numbers.  These  results  are  extensions  of  those  presented  below  (and  are  just  as 
intuitive) ,  but  it  would  be  slightly  messy  to  consider  all  of  the  different  cases  at  once. 


We  proceed  to  implement  the  four  steps  of  the  basic  construction. 


Step  1:  As  for  the  two-sample  location  problem,  let  items  1, ...  ,nj  be  from  popu¬ 
lation  1,  and  items  n2  +  1,... ,nj  +  n2  be  from  population  2.  Let  jVi  and  Ar2  be  the  sets 


Ni  =  {l,...,nx}  and  =  {nx  +  l,...,nx  +  n2},  and  let  n  =  nx  +  n2.  The  observed 
permutation  n  c  Sn  is  the  rank  ordering  of  all  the  n  items. 

Step  2:  For  this  two-sample  dispersion  problem,  a  permutation  a  e  Sn  is  “equivalent” 
to  the  observed  permutation  ir  if  and  only  if  a  can  be  obtained  from  7r  by  a  combination 
of  the  following  two  types  of  operations:  (l)  within  each  population,  permuting  the  labels 
assigned  to  the  items  in  that  population;  and  (2)  transposing  the  items  ranked  in  positions 
i  and  n  1-1— t,  for  any  i  =  1, ...  ,».  Operations  of  type  (1)  have  the  same  justification  as  was 
given  in  the  two-sample  location  problem,  namely,  the  equivalence  class  [vr]  should  depend 
only  on  the  sets  of  ranks  n(Ni)  and  7r(JV2)  assigned  to  the  two  populations.  Operations  of 
type  (2)  are  justified  by  the  assumption  that  the  two  populations  have  the  same  median. 

To  give  a  group-theoretic  description  to  the  equivalence  class  [tt],  let  be  the 

subgroup  of  S»  corresponding  to  operations  of  type  (l)  above: 

Sni  x  Sna  s  {a  e  Sn:  a{Nj)  »  N}-  V  j  »  1,2}, 

and  let  T  be  the  subgroup  of  Sn  generated  by  the  transpositions  described  above  as  oper¬ 
ations  of  type  (2): 

T  =  {r  tSn:  V  »  =  1, . . . ,  n  :  r(»)  =  t  or  r(»)  =  n  +  1  -  »}; 

The  equivalence  class  [tt]  consists  of  all  permutations  of  the  form  r  ir  a ,  as  r  ranges  through 
T  and  a  ranges  through  5n,  x  5na.  Hence  [?r]  is  a  so-called  “double  coset”  (Curtis  and 
Reiner  (1966)),  and  is  denoted  by  T  r  5ni  x  5na. 

Step  3:  The  extremal  set  E  consists  of  all  permutations  which  rank  the  items  from 
the  second  population  “in  the  middle,”  and  the  items  from  the  first  population  “at  the 
two  ends.”  Pictorially,  E  consists  of  all  rankings  of  the  form 

-_X  O^O 

»i/2  *»a  ni/2 
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where  X’s  and  O' s  denote  items  from  populations  1  and  2,  respectively,  and  where  the  left 
tail  of  X’s  and  the  right  tail  of  X’s  have  the  same  length.  Thus 

E  =  {0€  $„:  1 0(i)  -  =±i|  >  1 0(j)  -  ^1  V  i  e  Nt  V  j  .  N,} 

=  {0  t  S„:  |/J(JV,)  -  2±A|  >  | W,)  - 


Clearly,  E  is  the  equivalence  class  [0]  containing  the  extremal  permutation 

i  for  *  =  1,. . . , 

V>(*)  =  <  *  +  n2  f°r  *  =  ^  +  1»  •  •  • » ni 

i  -  for  *  =  ni  +  1, . . . ,  nj  +  n2. 

Thus  E  is  the  double  coset  T  t/>  5n,  x  Sn, . 


Step  4:  To  state  the  test  statistics,  we  introduce  some  further 


Notation:  Let  bit . . . ,  bni  be  an  enumeration  of  ir(Ni)  such  that 

and  let  bni+i, . . . ,  bn  be  an  enumeration  of  x(.W2)  such  that 

For  any  real  number  r,  let  int(r)  denote  the  greatest  integer  which  is  less  than  or  equal  to 


Theorem  9:  The  test  statistics  for  the  two-sample  dispersion  problem  with  equal 
medians  and  ordered  alternatives,  induced  by  the  metrics  K,  F,  R,  and  H  of  Section  3, 


K{TirSni  x  Sn3,2VSni  x  Sna)  =  £xwn{*(«),»+  1  -  *(»)}  -  "l(rai  t£2. 

»» 1 

H 1 

This  is  equivalent  to  the  Ansari-Bradley  (1960)  test  statistic  =  J2  min{x(*),n+ 1  — *(*)}. 


F(T*Sn.  x  Sn3,Ttl>Sni  xSnj)=  2K{TwSHt  x  Sn„T*Sni  x5nj). 


This  is  obviously  also  equivalent  to  the  Ansari-Bradley  statistic. 


RHT*S„,  x  Sn„T^S„,  x  S_)  =  £  -  |i>(  -  -  int  • 


This  is  equivalent  to  the  (new)  test  statistic  0nt  (*T^) 


»*x 


H{T*Sni  x  5n„r^Sni  x  5na)  =  2m  -  2#  {«  c  Nn  x(i)  <jor  »(.*)  >  »  +  1  -  y } 


For  equal  sample  sizes  (m  =  n2  —  n/2),  this  is  equivalent  to  the  “quartile  test  statistic” 
(Westenberg  (1948)) 


=  #{;  6  Nil  n(i)  <  ~  or  *(»)  >  Y  +  !}. 


Proof:  See  the  Appendix.  □ 

For  the  unordered  alternatives  problem,  we  still  have  [?r]  =  T  ir  Sn,  x  5n, ,  but  now 
E  —  T  S2  ©  V*  5n,  X  Sna 

s  {0  €  S„:  3  <r  £  S,:  1/5(0  -  2±i|  >  1 0U)  -  2±1|  v  <  £  N.m  v  j  <  «,„>} 
s  {0  I  S„:  3  a  <  5,:  |«JV,(I))  -  2±i|  >  |/5(W„(2))  -  ^±±|}. 


Notation:  Define  bi,...,bn  by 


f  for  i  —  1, . . . ,  nj 

(  for  i  =  “f  1, . .  • , n. 


Theorem  10:  The  test  statistics  for  the  two-sample  dispersion  problem  with  equal 
medians  and  unordered  alternatives,  induced  by  the  metrics  K,  F ,  R,  and  R  of  Section  3, 
are: 


K(TirSni  xSn,,T.S2©  xjjSn ,  xSna) 


mn2 

4 


min{?r(t),n  + 


-  m 


(  n  +  2N\ 
V  4  ) 


This  is  equivalent  to  the  two-sided  Ansari-Bradley  statistic  (which  is  the  absolute  value 
term  in  the  above  expression). 


F{T  ir  Sn,  x  Sna S%  ©  xj>  Sn ,  x  Sna)  —  2 K[T  t  Sni  x  Sna,T  S%  ©  ifr  Sni  x  <?n8)« 

This  is  obviously  also  equivalent  to  the  two-sided  Ansari-Bradley  statistic. 

R*(T  5T  Sni  x  SnatT  S2  ©  0  Sni  x  Sna) 

I 

—  int 


— {E(¥- 


bi- 

h- 


n  +  1 


2 

n  +  1 


(¥))■ 


This  is  equivalent  to  the  statistic 


-cc  (w  Or)  h  -  ^1)  -  s  (**  |5<  - 

which  is  the  two-sided  version  of  the  corresponding  statistic  introduced  in  Theorem  9. 
H[T  jt  SB,  x  Sn3,T  Sj  0  ^  5n,  x  Sni ) 

=  2jmin{nj  “  #{*'  «  *(*)  <  y  or  *(0  >  n  +  1  -  ^-». 

For  equal  sample  sizes,  this  equals  ni  —  2|  #  {te  JV^:  *(*)  <  n/4  or  jt(»)  >  3n./4  +  1} 
nj/2|,  which  is  equivalent  to  the  two-sided  quartile  test  statistic 


n 


=  I  #{*'<  "■(*■)  <  “  or  »(*)  >  y  +  1>  -  y  | 


Proof:  See  the  Appendix.  □ 

The  extension  to  the  r-sample  dispersion  problem  (r  >  2),  with  equal  medians  as¬ 
sumed,  and  with  either  ordered  or  unordered  alternatives,  is  relatively  straightforward. 
These  test  statistics  will  appear  in  a  future  paper,  along  with  investigations  of  their  sta¬ 
tistical  properties. 
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12.  Testing  for  Trend 

We  conclude  with  a  very  trivial  application  of  the  basic  construction,  to  the  problem 
of  testing  for  trend.  Suppose  item  :  is  from  population  t  with  cumulative  distribution 
function  Fi.  We  are  testing  Hq:  Fi(x)  =  ...  =  Fn(x)  versus  Hi :  Fi(x)  >  ...  >  Fn[x) 
(where  each  inequality  is  strict  for  some  x).  Clearly  the  equivalence  class  [x]  consists  only 
of  the  observed  ranking  r  itself,  the  extremal  set  E  is  just  the  identity  permutation  t,  and 
the  test,  statistic  induced  by  the  metric  d  is  d(ir,e).  This  simple  case  has  been  mentioned 
in  order  to  show  that  the  procedure  “works"  here,  and  also  to  show  that  measuring  the 
distance  from  Hi  produces  a  very  natural  test  statistic. 

13.  Some  Connections  with  Previous  Work 

Many  important  questions  are  concerned  with  relating  this  novel  method  to  the  exists 
mg  nonparametric  theory.  We  will  consider  four  such  potential  areas  of  interconnection: 
(1)  the  theory  of  maximal  invariants,  (2)  the  theory  of  locally  most  powerful  rank  tests 
and  linear  rank  tests,  (3)  the  so-called  “inverse  problem,"  and  (4)  partially  ranked  data. 
To  keep  the  discussion  manageable,  we  will  consider  each  of  these  topics  in  relation  to  the 
two-sample  location  problem  (and  will  therefore  use  the  notation  of  Sections  4  and  5). 

13.1  Maximal  Invariants.  The  theory  of  maximal  invariants  (as  presented,  for 
example,  in  Lehmann  (1959))  may,  in  some  cases,  yield  a  more  precise  formulation  of  the 
set  [x]  described  in  Step  2  of  the  basic  construction.  Indeed,  for  the  two-sample  location 
problem,  the  maximal  invariants  are  the  set  . .  ,ni}  of  ranks  assigned  to  the  first 
population.  On  the  other  hand,  the  equivalence  class  [tt]  consists  of  all  permutations  a 
such  that  a{l,...,ni}  =  jr{l,...,»i}.  Thus,  for  the  two-sample  location  problem,  the 
equivalence  class  [tt]  consists  of  all  permutations  which  produce  the  same  values  of  the 
maximal  invariants  as  ir. 

13.2  Locally  Most  Powerful  Rank  Tests,  and  Linear  Rank  Tests.  Another 
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natural  area  of  inquiry  is  the  relation  with  the  theory  of  locally  most  powerful  rank  tests,  as 
presented,  for  example,  in  Hajek  and  Sidak  (1967).  In  this  regard,  consider  the  two-sample 
location  problem  with  ordered  alternatives,  for  which  the  hypotheses  are  Ho :  ^(x)  =  /(x) 
versus  Hi’.  g(x)  =  f(x  —  A)  for  some  A  >  0.  Here  f(x)  and  g(x )  are  the  underlying  den¬ 
sities  for  the  first  and  second  populations,  respectively;  they  are  assumed  to  be  absolutely 

0° 

continuous  and  to  satisfy  /  ) /'(x)j  dx  <  oo. 

~oo 

Let.  ir  e  Sn  be  the  observed  ranking  of  all  n  =  nt  +  n2  items,  as  discussed  in  Section  4. 
Then  for  A  >  0  sufficiently  small,  the  locally  most  powerful  rank  test  (Hajek  and  Sidak, 
pages  67-68)  rejects  Ho  for  small  valuos  of  the  test  statistic 

v-  /  r  (*<'“»)! 

k  1  fw™)  r 

where  X^  <  ...  <  X ^  are  the  order  statistics  for  a  random  sample  of  size  n  from  the 
density  /. 

It  is  natural  to  ask  whether  this  locally  most  powerful  test  statistic  is,  in  fact,  induced 
by  our  basic  construction,  by  some  metric  on  permutations.  More  generally,  the  locally 
most  powerful  rank  test  statistics  are  a  subclass  of  the  so-called  “linear  rank  test  statistics" 
of  the  form  M^O)*  aJid  we  could  ask  whether  any  such  linear  statistic  is  induced  by 

»=»i 

a  metric. 


The  answer  to  both  questions  turns  out  to  be  yes. 

Notation:  Define  the  function  hj :  { 1, . . . ,  n}  — *  R  by 

Mfl.  for  Ja  1, ....  n. 

Hijek  and  Sidak  called  these  quantities  the  “scores"  generated  by  the  density  /;  we  assume 
henceforth  that  the  hf(j)  are  strictly  increasing  in  j.  Define  the  metric  dj  on  Sn  by 

df[n,a)  =  IM*(0)  "  hf(°i 0)1- 

i=i 
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Theorem  11:  For  the  two-sample  location  problem  with  ordered  alternatives,  the 
test  statistic  induced  by  the  metric  df  is 

*1  nj 

d/(fl'Snl  x  Sn, , Sni  x  5„a)  =  2^^h/(7r(t))  —  2  h/(t), 

»=i  »=i 

and  thus  is  equivalent  to*  the  locally  most  powerful  rank  test  statistic  ^  hj (■*({))  for 

»»i 

testing  the  hypotheses  Hq :  g(x)  =  f(x)  versus  Hi:  g(x)  =  f(x  —  A). 

Proof:  Entirely  analogous  to  the  proof  of  Theorem  1  for  the  Spearman’s  footrule 
metric  F.  In  that  proof  (see  the  Appendix),  substitute  hf(ir(t))  for  7r(t)  everywhere,  and 
h/(i)  for  t.  □ 

Remarks:  (1)  The  metrics  df  which  yield  the  locally  most  powerful  rank  tests  are 
a  subclass  of  the  so-called  “fixed  vector  metrics,"  studied,  in  other  statistical  contexts,  by 
Rukhin  (1972),  Diaconis  (1982),  and  Critchlow  (1985,  pages  27-30). 

(2)  The  same  argument  shows  that  an  arbitrary  linear  rank  statistic  ^  MMO)  i* 

i>i 

induced  by  the  metric  d[n,cr)  =  ^  |h(?r(*))  —  h(cr(»))|. 

»*i 

13.3.  The  “Inverse  Problem” .  Given  any  locally  most  powerful  rank  test  statistic 
for  the  two-sample  location  problem,  there  exists  a  metric  on  permutations  which  induces 
it,  and  that  metric  was  presented  in  Section  13.2.  The  natural  generalization  of  this  line  of 
inquiry  is  the  so-called  “inverse  problem” ;  namely,  given  an  arbitrary  rank  test  statistic,  we 
can  ask  whether  it  is  induced  by  some  metric.  Theorem  11  of  Section  13.2  solves  the  inverse 
problem  for  the  locally  most  powerful  statistics  for  the  two-sample  location  problem,  and 
more  generally  for  all  linear  rank  test  statistics.  However,  for  the  two-sample  location 
problem,  there  are  important  test  statistics  which  are  not  locally  most  powerful  against 
any  alternative,  and  which  are  not  linear,  such  as  the  Kolmogorov-Smirnov  test,  the  Wald- 
Wolfowitz  (1940)  “runs  test”,  the  so-called  “tests  based  on  exceeding  observations”  (Hajek 
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and  Sidak,  pages  89-90),  and  Lehmann's  (1951)  “quadruples  test".  We  now  discuss  the 
inverse  problem  for  these  tests. 

Of  course,  a  principal  reason  for  posing  the  inverse  problem  for  any  test  statistic  is 
that  if  a  corresponding  metric  can  be  found,  then  that  metric  can  be  used  to  extend  the 
given  test  statistic  to  all  other  hypothesis  testing  problems. 

By  Theorem  1  of  Section  4,  Ulam’s  metric  U  induces  the  Kolmogorov-Smirnov  test 
statistic,  for  equal  sample  sizes.  The  following  result  shows  that  the  Wald-Wolfowitz  test 
statistic  is  also  induced  by  a  metric. 

Definition:  Let  it  t  Sn  be  the  observed  ranking  of  the  nj  items  from  population  1 
and  the  n 2  items  from  population  2,  where  tl\  +  n.2  =  n.  Define  the  number  of  “runs”  in 
it  to  be 


#{*  =  1,.. v  l(*(i)  +  1)  >  rii)  +#{» =  n1  +  1, ... , ni  +  n^:  ir~l  {tt(*)  +  1)  <  m), 

with  the  convention  that  if  tt(i)  =  n,  then  i  contributes  1  to  the  number  of  runs.  Thus,  if 
X’s  and  O’s  denote  the  items  from  populations  1  and  2,  respectively,  then 

#  runs  =  #(AT’s  followed  by  an  O)  +  #(0's  followed  by  an  X)  +  1. 

The  Wald-Wolfowitz  “runs  test”  statistic  is  precisely  the  number  of  runs  in  ?r. 

Theorem  12:  For  ?r,  a  e  Sn ,  define 

W(tt,ct)  =  #{:  =  l,...,n  -  1:  7ro_1(»  +  1)  ^  7r<7-1(t)  +  l}. 

Then  W  is  a  metric  on  5„.  Moreover,  for  the  two-sample  location  problem  with  unordered 
alternatives,  the  test  statistic  induced  by  W  is 

W(irSni  x  Sn,,S3  ©  Sn,  x5nJ  =  #(runs  in  it)  -  2, 
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which  is  equivalent  to  the  Wald-Wolfowitz  “runs  test”  statistic. 


Proof:  See  the  Appendix.  □ 

Remarks:  (1)  We  could  call  W(ir,<r)  the  “successor  metric”  because  it  counts  the 
number  of  items,  among  those  given  the  top  n  —  1  ranks  by  a,  which  have  a  different 
successor  under  the  ranking  ir  than  they  do  under  cr. 

(2)  The  metric  W{k,o)  can  now  be  used,  in  the  context  of  our  basic  construction,  to 
find  “natural  extensions”  of  the  Wald-Wolfowitz  runs  test  to  all  of  the  other  hypothesis 
testing  situations  considered  in  this  paper.  These  extensions  will  be  derived  and  discussed 
in  a  future  paper. 

(3)  It  also  turns  out  that  all  of  the  “tests  based  on  exceeding  observations”  (Hijek 
and  Sidik,  pages  89-90)  are  induced  by  suitable  metrics,  as  is  Lehmann’s  (1951)  one-sided 
“quadruples  test”  statistic 

=  #{(*,J.M)  «  Ni  x  Ni  x  N2  x  N2:  »(•)  <  x(j)  <  x(k)  <  tt(/)>. 

Therefore,  these  tests  also  have  natural  extensions  to  other  testing  situations.  A  subsequent 
paper  will  introduce  the  corresponding  metrics  and  investigate  the  extensions. 

(4)  The  author  is  not  aware  of  any  procedure  which  solves  the  inverse  problem  in 
general.  Rather,  solutions  to  the  inverse  problem  have  thus  far  been  obtained  only  on  a 
case  by  case  basis.  In  the  case  of  the  Wald-Wolfowitz  test,  for  example,  the  author  just 
happened  to  observe  that  the  metric  W(ir,o)  “worked". 

(5)  In  Section  13.2,  it  was  shown  that  all  locally  most  powerful  statistics,  and  indeed 
all  linear  statistics,  are  induced  by  metrics.  Moreover,  the  present  section  demonstrates 
that  the  Kolmogorov-Smimov  and  Wald-Wolfowitz  statistics  are  also  induced  by  metrics, 
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even  though  they  are  not  linear.  Hence,  our  basic  construction  actually  gives  rise  to  a 
larger  class  of  test  statistics,  for  the  two-sample  location  problem. 

13.4  Partially  Ranked  Data,  and  the  Hausdorff  Metric. 

A  full  ranking  of  n  items  is  simply  an  ordering  of  all  these  items,  of  the  form:  first 
choice,  second  choice,  . . .,  n-th  choice.  If  two  judges  each  rank  the  same  n  items,  statis¬ 
ticians  and  other  scientists  have  used  the  various  metrics  of  Section  3  to  measure  the 
closeness  of  the  two  rankings. 

A  partial  ranking  arises  when  each  judge  specifies  only  his  first  k  choices,  where  k  < 
n.  The  author’s  recent  monograph  (Critchlow  (1985))  is  concerned  with  extending  the 
metrics  of  Section  3  to  metrics  between  partial  rankings,  and  with  exploring  the  many 
data-analytic  applications  of  such  metrics.  More  complex  types  of  partially  ranked  data 
are  also  investigated. 

What  is  the  relationship  of  that  material  with  the  present  paper  on  rank  tests?  It 
is  perhaps  surprising  that  any  connection  exists  at  all,  since  the  two  topics  are  ostensibly 
quite  distinct.  However,  as  shown  in  the  monograph,  a  partial  ranking  of  n  items  can  be 
identified  with  a  right  coset 

Sir  —  {ere  Sn :  3  o  t  S:  a  =  on} 

of  a  certain  subgroup  5  of  Sn.  The  idea  here  is  to  identify  the  partial  ranking  with  the 
set  of  full  rankings  that  are  consistent  with  it,  and  that  set  turns  out  to  be  a  right  coset 
of  the  above  form  (Critchlow  (1985),  pages  12-14). 

Distances  between  partial  rankings  can  therefore  be  identified  with  distances  between 
right  cosets  Stt,  So.  To  obtain  such  a  distance,  let  d  be  a  metric  on  Sn,  and  consider  the 
quantity 

d(Sn,So)  =  max  <  max  min  d( a,/?),  max  min  d(a,/3) 

({3cStr  atSw  atSr  0cSff 
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This  is  called  the  “Hausdorff  distance”  between  the  cosets  Sit  and  So,  induced  by  d.  It 
satisfies  all  of  the  axioms  for  a  metric  on  the  space  of  such  cosets  (e.g.  Nadler  (1978)).  The 
author’s  monograph  computes  the  Hausdorff  distances  between  such  right  cosets,  induced 
by  the  metrics  K ,  F ,  R,  H,  and  U. 


For  nonparametric  rank  tests,  the  sets  of  permutations  in  Steps  2  and  3  of  the  basic 
construction  are  frequently  left  cosets  itS  of  a  subgroup  S  of  Sn.  As  illustrated  by  the 
two-sample  location  problem  with  ordered  alternatives,  many  useful  test  statistics  can  be 
obtained  by  computing  the  minimum  interpoint  distance  between  two  such  left  cosets. 

Moreover,  it  can  be  shown  that  the  minimum  interpoint  distance  between  two  left 
cosets  actually  coincides  with  the  Hausdorff  distance  between  the  left  cosets: 


Proposition:  Let  G  be  any  finite  group,  S  any  subgroup  of  G,  and  d  any  right- 
invariant  metric  on  G.  Then  the  Hausdorff  distance  between  any  two  left  cosets  itS  and 
oSs 

max  <  max  min  d(a,0),  max  min  d(a,/J) 
aei rS  atrS  fittrS 

is  equal  to  the  minimum  interpoint  distance 


mm  d(a,P). 

airS 

0t<rS 


Proof:  This  is  the  Lemma  on  pages  21-24  of  Critchlow  (1985),  with  the  roles  of 

“left”  and  “right”  reversed.  □ 

In  general,  Hausdorff  distances  between  left  cosets  are  not  the  same  as  Hausdorff 


distances  between  right  cosets.  That  is,  the  rank  test  statistics  of  Theorem  1  are  distinct 
from  the  metrics  on  partial  rankings  in  the  author’s  Monograph.  However,  they  can  be 
derived  by  analogous  arguments. 


In  sum,  two  different  areas  of  statistical  interest  —  nonparametric  rank  tests  and 
partially  ranked  data  —  can  be  approached  by  quite  similar  analytical  tools. 

14.  Conclusion 

The  construction  of  nonparametric  rank  tests,  and  the  extension  of  such  tests  to  other 
hypothesis  testing  situations,  axe  important  statistical  problems.  Various  techniques  have 
been  suggested  in  the  statistical  literature,  for  generalizing  rank  tests  to  other  testing 
situations.  For  example,  Hajek  and  Sidak  (1967)  discuss  the  literature  on  extending  two- 
sample  location  tests  to  the  multi-sample  problem,  and  also  describe  the  Siegel-Tukey 
(1960)  method  of  converting  tests  of  location  into  tests  of  dispersion. 

The  methodology  presented  in  this  paper  simultaneously  extends  an  arbitrary  rank 
test  to  many  other  hypothesis  testing  situations.  Since  the  approach  gives  rise  to  so  many 
useful  test  statistics,  both  old  and  new,  its  properties  deserve  further  investigation. 

*  *  * 

APPENDIX:  DERIVATIONS  OF  THE  INDUCED  TEST  STATISTICS 

In  this  appendix  we  will  derive  the  test  statistics  induced  by  the  metrics  K,  F,  R, 
H,  and  U  for  the  two-sample  and  multi-sample  location  problems,  the  two-way  layout, 
the  one-sample  location  problem,  and  the  two-sample  dispersion  problem.  Each  testing 
problem  will  be  considered  for  both  ordered  and  unordered  alternatives.  In  other  words, 
we  will  prove  Theorems  1  through  10. 

Moreover,  we  will  prove  Theorem  12,  which  states  that  the  metric  W[n,c),  described 
in  Section  13.3,  induces  the  Wald-Wolfowitz  “runs  test”  statistic. 

Before  proceeding  to  the  details  of  the  individual  proofs,  it  will  be  helpful  to  discuss  a 
pair  of  general  properties  possessed  by  many  metrics  on  permutations:  the  right-invariance 


property  (originally  noticed  by  Diaconis  (1982))  and  the  “transposition  property”.  These 
properties  will  be  used  extensively  in  the  derivations.  Two  additional  general  properties, 
the  “first  and  second  partition  properties” ,  will  be  introduced  in  the  proof  of  Theorems  5 
and  6,  and  will  be  instrumental  for  Theorems  9  and  10  as  well. 

1.  The  Right-Invariance  Property 

Definition:  The  metric  d  on  Sn  is  right-invariant  if  d(?r,  a)  —  d(nr,  or)  V  r,  a,  r  e  Sn. 

Proposition  A.l:  All  of  the  metrics  on  permutations  presented  in  this  paper  are 
right- invariant.  In  particular,  K,  F,  R,  H,  and  U  are  right-invariant. 

Proof:  The  proof  is  very  easy  and  is  omitted.  □ 

Remarks:  Proposition  A.l  is  not  a  coincidence,  as  it  is  natural  to  require  that  all  of 
our  metrics  be  right-invariant.  Indeed,  in  Section  3,  to  set  up  the  correspondence  between 
rankings  and  permutations,  a  preliminary  step  was  to  assign  arbitrary  numerical  labels  to 
the  n  items:  item  1, . . . ,  item  n.  It  is  natural  to  insist  that  our  distances  between  rankings 
should  not  depend  on  this  arbitrary  labeling  of  the  items,  and  should  be  invariant  under  an 
arbitrary  relabeling.  Mathematically,  this  amounts  to  requiring  that  the  metrics  be  right- 
invariant.  See  Diaconis  (1982)  or  Critchlow  (1985,  pages  10-11)  for  further  discussion. 

The  right-invariance  property  has  the  following  important  consequence.  Many  of  the 
test  statistics  in  this  paper  are  obtained  by  computing  the  minimum  interpoint  distance 
between  a  certain  subgroup  5  of  5n,  and  a  left  coset  irS  of  that  subgroup.  The  right- 
invariance  property  implies  that  this  distance  is  the  same  as  the  minimum  interpoint 
distance  between  nS  and  the  identity  permutation  e: 

Proposition  A.2:  Let  d  be  a  right-invariant  metric  on  Sn,  let  5  be  an  arbitrary 
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subgroup  of  Sn,  and  let  jrS.be  a  left  coset  of  S  (where  jt  e  S„).  Then 


min  d(a,/3)  =  min  d(a,  e). 

acrS  atwS 

fitS 


Proof: 

min  d(at0)  =  min  d(irai,o2)  =  min  dfnoioZ1  ,e) 
atrS  <Ti,<T3  tS  »ii»a  tS 

fitS 

=  min  d{iro,t)  =  min  d(a,  e)£] 

<rtS  '  '  acrS  V  ' 


2.  The  Transposition  Property 


A  second  property  of  many  metrics  on  permutations,  which  will  be  useful  for  deriving 
several  rank  test  statistics,  is  the  so-called  “transposition  property" . 

Definition:  Let  d  be  a  metric  on  Sn.  Let  ct,/3,7  eSn  be  permutations  such  that  a 
and  0  differ  by  a  single  transposition;  that  is,  there  exist  integers  p,  q  e{l, . . . ,  n}  such  that 

*(p)  =  /*(?) 

“(?)  =  0[p) 

<*(0=0(0  VtVp.q. 

Suppose  further  that  0{p)  <  0{q)  and  7 (p)  <  7(g).  If  the  preceding  conditions  imply  that 
d{0,~i)  <  d(a,~i),  then  the  metric  d  is  said  to  possess  the  “transposition  property”. 

Proposition  A. 3:  The  metrics  K,  F,  and  R  possess  the  transposition  property. 


Proof:  This  is  Lemma  2,  pages  50-53  of  Critchlow  (1985).  □ 

Remarks:  Critchlow  (1985)  also  gives  counterexamples  which  show  that  the  metrics 
H  and  U  do  not  have  the  transposition  property. 


In  the  preceding  reference,  the  transposition  property  is  helpful  for  a  different  statis¬ 
tical  purpose ,  that  of  deriving  suitable  metrics  on  partially  ranked  data.  Essentially  the 


same  property  has  also  been  investigated,  in  yet  another  statistical  context,  by  Hollander, 
Proschan,  and  Sethuraman  (1977). 


* 


* 


* 


We  proceed  now  to  the  proofs  of  Theorems  1  through  10,  and  of  Theorem  12. 

Proof  of  Theorems  1  and  3:  We  will  prove  Theorem  3  (since  Theorem  1  then 
follows  as  a  special  case). 

For  the  multi-sample  location  problem  with  ordered  alternatives,  the  test  statistics 

(in  Theorem  3)  are  of  the  form  d(7rSni  x  ...  x  Sn,,Sni  x  ...  x  SnJ,  where  d  is  one  of 

the  metrics  K,  F,  R,  H,  U.  Since  all  of  these  metrics  axe  right-invariant,  Proposition  A.2 

implies  that  the  test  statistic  induced  by  d  is  equal  to  min  d(a,  e). 

{ktwS^i  x...x5^r 

Let  oto  e  jt 5ni  x . . .  x  Sn,  be  a  permutation  which  actually  attains  the  above  minimum; 
that  is,  oto  satisfies  d(ao,c)  =  min  d(a,  e).  Obviously  ao  depends  on  d.  The 

X.««X5||r 

rest  of  the  proof  splits  into  three  cases,  according  to  the  metric  d  under  consideration:  (i) 
d  =  K,F,  or  R ,  (ii)  d  =  U,  and  (iii)  d  —  H. 


(i)  If  d  =  K,  F,  or  R,  then  we  can  construct  ao  as  follows.  Recall  that  an,  +...+n,_,  +i  < 
an,  +2  <  ...  <  a», +...+!»„_»+»,  axe  the  ranks  assigned  by  i r  to  population  j,  for 

j  =  1, . . . ,  r.  Define  ao  €  Sn  by  ao(i)  =  a,-,  for  *  =  1, ... ,  n.  Clearly  ao  £  ir Sni  x  . . .  x  Snr, 
since  ao  assigns  the  same  set  of  ranks  to  population  j  as  does  n.  Moreover,  it  is  easy  to 
see  that  ao  satisfies  d(ao,e)  =  min  <f(a,  e),  because  each  of  the  metrics  K ,  F, 

X...X5nr 

and  R  possesses  the  transposition  property.  Indeed,  given  any  a  £  ir 5n,  x  ...  x  5n,,  we 
can  obtain  ao  from  a  by  a  sequence  of  transpositions,  each  of  which  satisfies  the  condition 
stated  in  the  definition  of  the  transposition  property.  (The  first  transposition  transposes 
item  1  with  item  Oq  ^aj),  and  so  on;  the  t-th  transposition  transposes  item  »  with  the 
item  currently  assigned  the  rank  a,.) 
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Thus  if  d  =  K,  F,  or  R,  the  test  statistic  induced  by  d  is  d(ao,e),  with  ao  constructed 
as  above.  The  results  of  Theorem  3  follow  immediately,  for  these  three  metrics. 

(ii)  If  d  =  U,  we  claim  that  the  same  permutation  ao  as  constructed  in  (i)  satisfies 

U(a<},e)  —  min  CT(a,  e).  Since  the  metric  U  does  not  possess  the  transposition 
X...xS„ 

property,  this  claim  must  be  proved  by  a  different  argument  than  in  (i). 

Lemma  A.l:  Let  a  e  r 5n,  x  . . .  x  Snr  be  arbitrary,  and  suppose  that  the  ranks 
t\,*3  t  {l,...,n}  satisfy  t'i  <  i2  and  a-1(t'i)  <  a-1^).  Then  ag  1(*i)  <  ag  l(t2). 

Proof  of  Lemma  A.l:  Recall  that  a“l(t)  is  the  item  assigned  the  rank  i,  and  that 
items  iti  +  ...  x  n^i  +  1,  ....nj  +  ...  -j-  n;-  are  from  population  j.  Now  if  items  a-1(t'i) 
and  a~1(i2)  are  from  the  same  population  j,  then  so  are  items  Oq  and  a^l(i2)  (since 
all  permutations  in  the  equivalence  class  irSni  x  . . .  x  Sn,  assign  the  same  set  of  ranks  to 
population  j).  It  follows  that  <  ag  1  (*3),  because  ctg  has  been  constructed  in  such 

a  way  that  within  each  population,  the  items  are  ranked  in  ascending  order. 

On  the  other  hand,  if  items  a”l(»i)  and  a-1  (t'2)  are  from  different  populations  j\ ,  j2 , 
then  <  a"*1  (t2)  implies  j\  <  j2,  whence  ag '(t'i)  <  ag  x(i2).  The  lemma  is  proved. 

□ 

From  the  lemma  it  follows  by  induction  that  for  any  a  e  n 5n,  x  ...  x  5n„,  if  t'i  < 
...  <  t*  satisfy  a-1^)  <  ...  <  a ”l(s*),  then  ag  x(*i)  <  ...  <  ag  l(ik).  Hence  LLIS 
{a-1(l),...,a“x(n)}  <  LL/5{ag  l(l), . . . , ag  l(n)}.  (Recall  “LLIS”  stands  for  “length 
of  the  longest  increasing  subsequence.”)  Therefore,  U(ao,e)  =  min  U(a,c),  as 

QiCTiSiij  X««*  X 

claimed. 


It  follows  that  the  test  statistic  induced  by  U  is  U(ao,e),  with  ao  constructed  as  in 
(i).  To  evaluate  U{ ao,e),  we  notice  that  picking  an  increasing  subsequence  of  {ag  1  (l), . . . , 
.a^x(n)}  amounts  to  choosing  some  integers  i\  <  i2  <  . . .  <  ir-i,  and  then  choosing  those 


41 


items  from  the  first  population  among  the  first  *1  ranks,  those  items  from  the  second 


population  among  the  next  t’2  —  t‘i  ranks,  and  so  on.  In  this  subsequence,  there  will  be  £,,  1 
items  from  population  1,  £t-a  2  —  £*,2  items  from  population  2,  and  so  on,  where  £,y  is  the 
number  of  items  from  the  j'-th  population  which  fall  among  the  top  i  ranks.  Therefore 
U{ao,  e)  «  n  —  LLIS{c^x[l),. . . ,  0*  l(n)> 


n  —  max 

il 


r— 1 


*»il  +  (  y! fo.j  —  *»,•_! i)  j  +  (nr  ~  Ur~xr) 
2 


I  fr-1  ' 

=  (n  -  nr)  -  ,i<;m«  ^  ^  £(*t,y  -  £tj.y+l)  -  . 

This  completes  the  proof  of  Theorem  3  for  Ulam’s  metric.  As  observed  in  Sections  4  and 
6,  for  equal  sample  sizes  from  r  =  2  populations,  the  statistic  induced  by  U  is  equivalent 
to  the  Kolmogorov-Smimov  statistic.  When  r  >  2,  the  statistic  induced  by  U  is  a  natural 
extension  of  the  Kolmogorov-Smirnov  statistic. 


(in)  For  the  metric  <£  =s  IT,  we  construct  the  permutation  ao  as  follows.  For  j  =  1, . . . ,  r 
let  my  =s  #(ir(.Ny)  n  IVy),  where  we  recall  that  IVy  =  {m  + . . .  +  ny_x  + 1, . . . ,  n*  + . . .  +  ny } 
is  the  set  of  labels  for  the  items  in  population  j.  Let: 


cyj , . . .  ,c;my  be  an  enumeration  of  n (Nj)  n  Nj , 

ejm.+1 , . . . , c;ny  be  an  enumeration  of  n(N:)c  n  Nj, 
c'my+i , . . . , cyn>.  be  an  enumeration  of  n(Nj)  n  Nj. 


Define  oto  <  Sn  by 


for  k  =  1, . . . ,  my 
for  k  =  rrij  +  1, . . . ,  ny . 


Then  a 0  «  xSn,  x  ...Snr,  and  ff(a0,c)  =  n  -  ^  my. 

;  =  i 

r 

®  €  xSn,  x  ...  x  Sn, 1  =  n  y!  ®  iVy:  o(t) 

;=i 

r 

H(ao,  t)  =  min  f/{a,  e)  =  n  -  £  my. 


atrSn,  X...XS. 


;=i 


On  the  other  hand,  for  any 

r 

=  *}  >  n  —  ^  m;.  Thus 
j=i 
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This  completes  the  proof  of  Theorem  3.  □ 

Proof  of  Theorems  2  and  4:  We  will  prove  Theorem  4,  since  Theorem  2  then 
follows  as  a  special  case. 

The  test  statistics  in  Theorem  4  are  the  minimum  interpoint  distances  between  the  sets 
\ir\  =  x5n,  x  ...  x  5n,  and  E  =  Sr  ©  5nt  x  . . .  x  Sn,  =  {/?  e  Sn:  3  at  Sr :  i))  <  . . .  < 

P[N0fr))}.  These  statistics  are  therefore  equal  to  min  min  d(a,/3),  where,  for 

&cSr  X««»X5||r 

fit er5%l  X...x5v 

each  oeSr,  e*  is  some  extremal  permutation  for  the  ordered  alternative  Hlt„:  Fa(i)(z)  > 
•  •  •  >  F9{r)(x),  and  where  E„  =  x  . ..  x  Sn„  =  {r  e  Sn:  r(A^<r(1))  <  . . .  <  r(W<T(r))} 
is  the  set  of  all  extremal  permutations  for 

We  tackle  now  the  problem  of  calculating  min  d(a,/?),  for  fixed  a  e  Sr. 

x...x5*f 

fit€wSni  X...XS*r 

Intuitively,  this  should  not  be  too  much  extra  work,  since  we  have  already  calculated  the 

test  statistics  for  the  particular  ordered  alternative  Fi(z)  >  . . .  >  Fr(x)  in  Theorem  3.  In 

fact,  we  will  demonstrate  formally  that  the  calculation  of  min  d(a,0)  follows 

aeir  S.,x...x5,r 
X...X5af 

directly  from  Theorem  3. 

Definitions:  For  fixed  a  e  Sr,  define  the  partition  JVf , . . . ,  N ?  of  {1, ,  n}  by  NJ  = 
e*(N <?(:))•  Let  s<r  =  Sn<rll)  x...x  5n,(r)  be  the  subgroup  of  Sn  given  by  {r  e  Sn:  t(N J)  = 
Nj  V  j  =  1, . . . ,  r).  Let  S  denote  the  subgroup  Sn,  x  . . .  x  Snr  considered  previously. 

Lemma  A. 2:  The  left  coset  t„S  equals  the  right  coset  S0c0. 

Proof  of  Lemma  A. 2:  Both  cosets  are  clearly  {r  c  Sn :  r(Nj)  =  V  j  =  1, . . .  ,r}. 

□ 

Lemma  A.3:  The  minimum  interpoint  distance  between  the  sets  [x]  =  xS  and 
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Eff  =  e9S  can  be  calculated  from  the  equation 


min  d(a,0)  = 

atrS 

fitt.S 


min 

atrtJlS. 

fits. 


d(a,0). 


Proof  of  Lemma  A.3:  By  Lemma  A.2  and  the  right  invariance  property, 


min  d(a,0)  = 

ocirS 

fitt.S 


min  d(v5i,e932)  =  mind(?re, e„) 
«1,«3  tS  *tS 


—  mind(jrserl,e)  =  mindfrrel^s.e)  = 

•tS  ttS. 


min  d(a,{3). 

atxe~lS. 

fits. 


□ 


The  motivation  behind  Lemma  A.3  is  that  we  already  know  how  to  find  the  mini¬ 
mum  interpoint  distance  between  the  sets  ire~lS9  and  Sa,  since  Theorem  3  calculates  the 
distance  between  two  sets  of  precisely  this  form. 

The  test  statistics  of  Theorem  4  follow  easily.  For  example,  to  find  the  statistic  induced 
by  the  metric  d  —  K,v/t  compute 

min  K(a,0)  =  ^  #{(t,  m)cNj  x  Nk:  re~l(i)  > 


<*««■«” 1 S. 

fits. 


3<k 


-  53#{(l’*m)€e<r(^C;))  x  t„{N9[ k))\  ne^{i)  > 


=  £#{(■'  ,m)eN<r(J)  x  NffW:  ir(i)  >  ir (m)}, 


and  so  the  statistic  induced  by  K  is 


mm  min  K{ctJ)  =  min  x  N*W'  »(t)  >  *■("»)}•  □ 

<rtsT  at*t~lS.  °ts. 

fits. 


°tSr  f 
}<* 


Proof  of  Theorems  5  and  6:  Two  new  properties  possessed  by  some  metrics  on 
permutations,  the  “first  and  second  partition  properties,”  are  fundamental  to  deriving  the 
test  statistics  for  the  two-way  layout. 


44 


Definition:  Let  d  be  a  metric  on  Sn  for  each  n  >  1.  (More  precisely,  d  is  a  collection 
of  metrics,  defined  on  Sn  for  each  n  >  1).  Consider  an  arbitrary  partitioning  of  the  set 
{l, . . . ,  n}  into  two  sets  Y  and  Z.  Let  y2  <  . . .  <  yr,  and  z\  <  . . .  <  zTa  be  enumerations 
of  Y  and  Z,  respectively,  where  r\  =  #y,  r2  =  t\  +  r2  =  n.  For  each  ft  eSn  define 

jry  c5ri  and  ft  %  e  Sra  by 

fty(j)  =  #{*  «  Y:  ft[k)  <  7r(yy)>  for  j  = 

*\z(/)  =  #{*  c  »(*)  <  »fo)}  for;  * 

If  the  preceding  conditions  imply  that 

d(7r,o)  >  d(7Ty,oy)  +  d(ft2taz)  for  all  :r,o  e  Sn, 

then  the  metric  d  is  said  to  possess  the  first  partition  property.  Further,  if  the  additional 
conditions 

Vi  <  Zj  for  allt,/, 

*(y»)  <  »(*/)  for  aU  *»/» 

o(y,)  <  a{zj)  for  all  :,/ 

imply  that  equality  is  attained  above,  i.e.  d[ft,c)  —  d(7ry,oy)  +  d(7r^,o^),  then  the  metric 
d  is  said  to  possess  the  second  partition  property. 

The  importance  of  the  partition  properties  is  due  to  the  following  observation  (the 
statement  and  proof  of  which  use  the  notation  of  Sections  8  and  9). 

Proposition  A. 4:  If  the  metric  d  possesses  both  the  first  and  second  partition 
properties,  then  the  induced  test  statistic  for  the  two-way  layout  with  ordered  alternatives 
is  the  sum 

b 

<%!.[«])  *  13  *(*»'«*)» 

i=i 
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and  the  induced  test  statistic  for  the  two-way  layout  with  unordered  alternatives  is 


b 

d([ir],  Sr  ©  [«])  =  mm  V  *(*«  *)• 

fftSr  7™ 
t»l 


Proof:  The  test  statistic  for  the  ordered  alternatives  case  is  <£([*■],  [e])  =  min  d(a,/3). 

culir] 

M 

b  b 

The  first  partition  property  implies  that  d(a, /?)  >  =  £1  c^(7r») c)  f°r  all 

»=i  i=l 

a  c  [*-],/?  c  [«].  On  the  other  hand,  if  we  consider  the  particular  permutation  olq  e  [tt] 
defined  by 


a0((»-l)r  +  j)  =  (t  -  l)r  +  7r,(j)  V  t  =  1, . . .  ,6  V  j  =  1,. ..  ,r, 

and  if  we  take  0o  =  e  e  [e],  then  the  second  partition  property  implies  that  the  preceding 
bound  is  actually  attained: 

t  b 

d[aOj0 o)  =  ^2  d(otoi,Poi)  —  d(’r*»e)- 

**»i  **i 

b 

Hence  min  d(a,P)  =  d(7r,-,e),  as  claimed. 

For  the  case  of  unordered  alternatives,  the  set  of  extremal  permutations  is  E  =  (J  Ec, 

fftSr 

where  E9  =  {r  e  Sn:  *  a  V  *  =  1, ...  ,6}.  For  each  a  e  Sr  let  e  Ea  be  the  permutation 
defined  by  e<r((t  —  l)r  +  j)  =  (t  —  l)r  +  a{j)  for  all  t  —  1, . . . ,  b  and  j  =  1, . . . ,  r.  The  test 
statistic  for  the  two-way  layout  with  unordered  alternatives  is 

<£((*■],£)  =  min  d[\n],E9) 

=  min  <*([*•],  [e*]) 

<TtSr 

=  min  cf([7r]c(r-i ),  ) 

<7tSr 

=  min  ^([Tre^-i ]),  [c]). 


By  the  result  proved  in  the  first  part  of  this  proposition  (the  ordered  alternatives  case), 
this  last  minimum  equals 

6  b  b 

mm  y'd((?rey- »),-,«)  =  min  y'd(?r,a“1,e)  =  min  d[irita).  □ 

<?tSr  <T€Sr  <?tSr  . 

«=i  1=1  t=i 

Theorems  5  and  6  will  now  follow  directly  from  the  following  fact. 

Proposition  A. 5:  The  metrics  K,  F ,  R2,  and  U  all  possess  both  the  first  and 

second  partition  properties. 

Proof:  The  proof  that  these  metrics  possess  the  second  partition  property  is  very 
straightforward,  and  is  omitted. 

To  show  that  K  satisfies  the  first  partition  property,  we  need  only  observe  (using  the 
notation  in  the  definition  of  the  partition  property)  that 

K(ir,a)  =  #{(t\j)«{l,...,»}2:  *(»)  <  tt(j)  and  a(t’)  >  o(j)} 

>  #{(»,  j)tY  x  Y:  7r(t)  <  ir(j)  and  o(t')  >  <r(j)> 

+  #{(*,/)«£  x  2:  ?r(*)  <  ir{j)  and  ff(t')  >  a[j)} 

-  ••»rx}2:  *V(0  <  *y(j)  and  cry(i)  >  ay (.?)} 

+  #{(».i)«{l»****r2>25  *z{')  <  *z{j)  and  az(i)  >  az[j )} 

=  K(ity,^y)  +  K{‘KZ,az). 

To  prove  the  first  partition  property  for  U,  let  1 1 , . . .  ,»m  be  a  maximal  collection  of 
items  ranked  in  the  same  relative  order  by  ir,o  e  Sn.  Let  yy, , . . . , yJp  and  Zfc,,...,Zfcf 
be  enumerations  of  the  sets  {t'i,...,tm}  n  Y  and  {ti,...,*m}  n  Z ,  respectively,  where 
p  +  q  —  m.  Then  yy, , . . . ,  y]p  are  ranked  in  the  same  relative  order  by  n  and  a,  and  hence 
ji, ...,jp  are  ranked  in  the  same  relative  order  by  7Ty,oy  e  STl.  Similarly  ku...,kq  are 
ranked  in  the  same  relative  order  by  nz,<^z  £  Sr3-  It  follows  that 

U[ir,o)  =  n  -  m  =  (rx  -  p)  +  (r2  -  g)  >  U{irY,aY)  +  U(nz,cz). 


We  sketch  the  main  ideas  of  the  proof  of  the  first  partition  property  for  F ,  which  is 
relatively  difficult.  The  proof  is  by  induction  on  n.  Assume  the  property  is  true  for  »  —  1, 
and  let  n,<r  c5n  be  arbitrary.  Let  me{l,...,n}  minimize  |tt(-)  —  o(-)|.  Without  loss  of 
generality  we  assume  m  e  Y,  and  so  m  =  ylm  for  some  index  tm.  Let  r\o'  eSn-i  be  the 
permutations  obtained  by  deleting  m,  i.e. 


{?r(i)  if  i  <  m  and  ir(t)  <  r(m) 

x(i)  —  1  if  i  <  m  and  ?r(t)  >  7r(m) 

ir(i  +  1)  if  t  >  m  and  x (i  +  1)  <  x (m) 

7r(t  -f  1)  —  1  if  »  >  m  and  ?r(i  +  1)  >  x(m) 


and  similarly  for  a'.  Finally,  let  Hi  =  {»  =s  l,...,n:  x(t)  <  x(m)  and  er(i)  >  <r(m)},  let 
n2  =  {t  =  l,...,n:  jr(t)  >  7r(m)  and  cr( t)  <  o(m)},  and  let  0  =  n i  U  Q2. 


It  is  straightforward  to  verify  the  following  four  equations: 

F{itz,oz)  =  F{ir'z,ff'z), 

F(x,<r)  =  F(x',<r')  +  +  |jr(m)  - 

F(jTy,Cy)  <  F(7Ty,Cy)  +  #(0  nK)  +  |?Ty(tm)  —  Oy 
ky(*m)  -  *y(*m)|  =  |*(m)  -  o(m)  -  (#(nx  n  Z)  -  #(n2  n  Z))\ 

<  |?r(m)  -  o(m)j  +  #(H  n  Z). 

From  these  equations  and  the  inductive  assumption,  we  obtain 
.F(7r,<r)  =  F{x',o')  +  +  |ir(m)  -  a(m)| 

>  F{*'Yte'Y)  +  F(*’zte'z)  +  #(H  n  Y)  +  #(fl  n  Z) 

+  I*Y  (t’m)  -  ^y  (t'm)l  -  #(0  n  Z) 

>  F(ty,cy)  +  F(xz ,cz), 

as  was  to  be  shown. 

The  proof  that  R 2  possesses  the  first  partition  property  follows  essentially  the  same 


lines  as  the  preceding  proof  for  F,  and  is  omitted.  □ 


.VJfrl 


A 


Proof  of  Theorems  7  and  8:  These  two  theorems  present  the  test  statistics  for  the 
one-sample  location  problem,  with  ordered  and  with  unordered  alternatives,  respectively. 
We  will  use  the  notation  of  Section  10. 

For  the  case  of  ordered  alternatives  (Theorem  7),  the  test  statistic  is  d(xSni  x  Si  x 
Sft„5ni  x  Si  x  Sn,).  Therefore,  this  statistic  can  be  inferred  directly  as  a  special  case  of 
Theorem  3,  where  the  statistic  d(irSni  x  ...  x  Sn, ,  Sn,  x  ...  x  S„r )  was  derived.  Thus, 
although  Theorem  3  deals  with  a  different  testing  situation  (the  multi-sample  location 
problem  with  ordered  alternatives),  the  two  statistics  are  algebraically  equivalent. 


As  an  example,  Theorem  3  implies  that  the  one-sample  location  statistic  induced  by 
the  metric  F  is  F(7rSni  x  Si  x  5ni,5n,  x  S\  x  SnJ  =  la«  ~  *1-  It  is  now  simply  a 
matter  of  adapting  to  the  notation  of  Section  10,  to  obtain  the  form  of  this  statistic  given 
in  Theorem  7.  Indeed,  for  the  one-sample  location  problem,  one  has  an+i_,  =  n  +  1  —  a,, 
and  so  Ylimi  la»  “  *1  =  2Hr*i(a»  “  *)  =  (2!Cr*i  *(0)  ~  ni(ni  +  !)•  Moreover,  from  the 


definition  of  it*, 


/»•(«) +  ifX,>0 
W  “  \  Sfl -»•(»•)  ifX,  <0, 


and  so  the  preceding  statistic  becomes 

_  \  ^  /  « /  >\  ^  i  \  - 


2  (**(0  +  “y”)  +  2  J2  ”**(»)) -n^ni  +  i) 

{ilNr.  Xi>0)  {UN , :  x,  <o) } 


=  2n\ 


n  +  1 


~  ni(ni  +  1)  -1-2  7r*(l)  ~  2  ^2  **(*) 


{UN,:  Xi>0) 


{♦tN,:  X,<0} 


-»,(»,  +  l)  +  *  Z  E  '‘('I 

{t'tN, :  Xi  >0}  l  {itNii  Xi< 0)} 

=  4  E  »'(o. 

{UN,:  X<>0} 

which  is  equivalent  to  the  Wilcoxon  signed  rani  statistic,  as  claimed  in  Theorem  7. 


J2  *‘(0 

{UN,:  X.<0)} 


For  the  case  of  unordered  alternatives  (Theorem  8),  the  test  statistic  is  d(7rSni  x  Si  x 
Sn  ,S2  ©  Sni  x  Si  x  SnJ.  This  statistic  is  most  easily  derived  by  noting  that  the  set  of 


extremal  permutations 


.Sj  ©  Sni  x  Sj  x  Sr ij  —  (5»i|  x  S i  x  Sni )  U  (cSn,  x  S\  x  Sni )  —  S  U  cS, 

where  S  =  Sni  x  Si  x  Sn, ,  and  where  e  £  Sn  is  defined  by  e(»)  =  n  4-  1  —  t,  for  i  =  1, . . . ,  n. 
Therefore  the  test  statistic  d(irS,  S2  ©  S)  equals  min{d(7rS,  S),d(irS,  tS)}. 

The  distance  d(7rS, S)  was  computed  in  Theorem  7.  To  evaluate  d(?rS, eS),  we 
first  note  that  c-1  =  c  and  hence  Sc-1  =  St  =  IS  (because  all  of  these  cosets  equal 
{rtSn:  t(N  1)  =  i^3,r(jV2)  =  N2,t(Nz)  =  N\})-  Therefore  (also  using  the  right-invariance 
property)  d{nS,  eS)  =  d(7cS,Si)  =  d(7rSc-1,S)  =  d(iriS,S).  This  last  distance  d(niSyS) 
can  be  found  directly  from  Theorem  3  (or  Theorem  7). 


As  an  illustration,  the  statistic  induced  by  F  (for  unordered  alternatives)  will  now 
be  calculated  explicitly.  This  statistic  is  min{d(jrS,S),d(7reS, S)},  which,  by  Theorem  3, 


n  n 

equals  min{£  |a»  —  *|,  Yi  |®»  ~  *|}>  where  ai  <...<«»,  is  an  enumeration  of  the  set 

ixl  i—1 

7r(iVi),  an,+ 2  <  ...  <  an  is  an  enumeration  of  Nz),ai  <  ...  <  an,  is  an  enumeration 


of  xe(iV'i), a„1+2  <  ...  <  an  is  an  enumeration  of  irt{Nz),  and  where  an,+i  =  7r(nx  +  l), 
an,+i  =  7re(ni  +  l).  For  the  one-sample  location  problem,  it  is  easy  to  verify  that 


a,  =  < 


®nj  +  l+t  for  i  —  1,  .  .  .  ,  Tl  1 

a,-  =  nx  + 1  for  i  =  rii  +  1 
(  ai_n,_i  for  *  =  rii  +  2, ... , n* 


The  test  statistic  is  thus 


"i 


i=l 


=  min{2^(oi-i'),2  J3(ani+i+t-  -1')} 

«=i  »=i  »=i 

Ttt  nj 

=  min{2^(at-  -  1),  2ni(ni  +  1)  -  2  ^(a,-  -  *)} 


i= 1  i=l 

=  min{4  ^  ir*(i),  2n1(n1  + 1)  -  4  ^  *’(*)} 

{itNf.  Xi> 0  *.>0} 


(The  next  to  last  equality  uses  Theorem  7.)  This  is  equivalent  to  the  two-sided  Wilcoxon 
signed  rank  statistic,  as  claimed  in  Theorem  8.  □ 

Proof  of  Theorems  9  and  10:  These  theorems  present  the  test  statistics  for  the 
two-sample  dispersion  problem,  with  ordered  and  unordered  alternatives,  respectively.  The 
derivations  of  these  statistics  will  make  use  of  all  of  the  general  metric  properties,  which 
we  have  thus  far  considered:  the  right-invariance  property,  the  transposition  property,  and 
the  first  and  second  partition  properties. 

We  will  use  the  notation  of  Section  11,  and  let  5  denote  the  subgroup  Sni  *  Sn3 
described  there.  For  simplicity,  we  assume  (as  in  Section  11)  that  the  sample  sizes  ni  and 
n2  axe  both  even  numbers. 

For  the  ordered  alternatives  problem,  the  extremal  set  is  E  =  TipS  =  t/»S,  and  so  the 
test  statistic  is 

d(TirS,^S)  =  d(TjrS,t/>)  =  min^  d[a, 

(by  right-invariance).  To  derive  this  statistic  we  must  therefore  construct  a  “minimizing” 
permutation  aoeTnS  such  that  d(a o,  VO  =  min atTirs  d(a,  t/>),  and  then  we  must  compute 
d(a0,^). 

It  is  straightforward  to  derive  the  statistic  for  the  metric  d  =  H.  Indeed,  let 
mi  =  #{»  e  Ni:  7r(:')  <  y  or  *(*)  >  n  +  1  -  y }  =  #[ir{Ni)  n  ip{Ni))\ 
we  must  show  that 

min  H(a,  0)  =  2nx  —  2mi. 

aiTirS 

Let  m2  =  #(^(^2)  n  rp(N2))  —  mj  -t-  n2  —  nx.  For  j  =  1,2,  let  c;  J, . . . ,  c;m>  be  an 
enumeration  of  the  set  ir(Nj)  n  4>(N:).  Choose  any  cto  €  S  such  that  for  all  j  and  k, 
cr0(^-1(cjfc))  =  7r_1(c;fc).  Let  q0  =  7 ro0  f  T-nS.  Then  Q0(t/>-1  (c;Jt))  =  cjk  =  ^(0-1(c;fc)), 
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and  hence  H(ao,i/j)  =  n  —  #{i  =  1 . ,n:  ao(t’)  —  V’(t)}  —  n~  (mi  +  m2)  =  2nx  —  2m\. 

Thus  min  H(a ,  i/>)  <  2nx  —  2mx.  To  prove  the  reverse  inequality,  we  simply  observe  that 
atT*S 

for  any  a  =  riro  e  TnS, 

2 

H(Tircr,4>)  =n-^2  #{*  «  =  ^(0) 

y=i 

>»- X>(™W)  n  *(*>)) 

;=i 

«»-£ #(*- (JV»  n *(JVy)) 
y=i 

=  n  —  (mi  -I-  m2) 

=  2nx  —  2mx. 

To  derive  the  statistics  induced  by  .fif,  F,  and  1Z2,  we  need  the  following  result,  the 
proof  of  which  uses  the  fact  that  these  three  metrics  satisfy  both  the  transposition  property 
and  the  first  and  second  partition  properties. 

Lemma  A.4:  For  d.  =  K,  F,  or  iZ2,  the  permutation  a0  e  TitS  which  minimizes 
d(-,0)  may  be  chosen  so  as  to  satisfy  the  following  constraints: 

#(ao(Wl)n{l  = 

ao(l)  <  . ..  <  ao(ni), 

Qo(ni  +  !)<...<  C*o(nl  +  n2)- 


Proof  of  Lemma  A.4:  Let  a  e  TnS  be  arbitrary.  Let  a'  =  ac  (  TnS,  where  o  t  S 
is  chosen  so  that 

a'(l)<.. .<«>,),  a'(AT,)  =  a(JVx), 


a'(ni  +1)  <  ...  <  a'(n),  =  0(^2)- 
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Then  d(a,rj))  >  d(a',ii>),  by  the  transposition  property  (since  a'  can  be  obtained  from  a 
by  a  sequence  of  transpositions  which  satisfy  the  defining  conditions  of  the  transposition 
property) . 

Next  we  partition  {l, . . . ,  n}  into  the  two  sets  Y  =  {1, . . . ,  n/ 2}  and  Z  —  {n/2  + 
1, . . . ,  n}.  As  defined  in  the  proof  of  Theorems  5  and  6,  let  aY  and  a!z  be  the  permutations 
induced  by  a!  for  these  two  sets,  and  similarly  define  Then  by  the  first  partition 

property,  d(a',V»)  >  d(a'Y ,^y)  +  d(a'z,^z). 

Next  let  a "  —  ra!  e  TirS,  where  t  e  T  is  the  composition  of  |ni/2  —  #(a'(Ni)  n 
{l,...,n/2})|  transpositions,  chosen  so  that  #(a"(JVi)  fl  {l,...,n/2})  =  ni/2.  Let  oto  = 
a"oo  e  TirS ,  where  cr0  e  S  is  chosen  so  that 

a0(l)  <  . . .  <  ^(JVO  =  a"(JVx), 

t 

ttof11!  +  l)  <  • .  •  <  C*o(n2)  =  2). 

Then  a0  also  satisfies  the  constraint  #(ao(JVi)  f>  {l,...,n/2})  —  nx/2.  Moreover,  a  re¬ 
peated  application  of  the  transposition  property  shows  that  d(a'y ,  V>y)  >  d(a:oy,  0y)  and 
d(a'z,ij>z )  >  d{a0z,ipz)>  and  hence 

<2(<*yi^y)  +  d(az,rl>z)  >  d(aoy  ,rjiy)  +  d(acoz,  'I’z)- 

Finally,  by  the  second  partition  property, 

d(aoy,  rpy )  +  d(aoz,^z)  =  d(a0,  4>). 

Putting  together  all  of  the  pieces,  we  obtain  the  desired  result: 
d(a,  ip)  >  d(a\  ip)  (by  the  transposition  property) 

>  d(ay,^y)  +  d(a'z,ipz)  (by  the  first  partition  property) 

>  d(aoy>0y)  +  d(<*oz,^z)  (by  the  transposition  property) 

=  d[a o,ip)  (by  the  second  partition  property).  □ 
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The  preceding  lemma,  is  the  key  step  in  deriving  the  statistics  induced  by  K,  F,  and 
R2 .  Indeed,  for  d  =  K  or  F,  it  is  relatively  easy  to  verify  that  all  permutations  a0  £  Tit S 
which  satisfy  the  conditions  of  the  lemma  will  produce  the  same  value  of  d(a0, 0),  namely, 
the  value  given  in  the  statement  of  Theorem  9.  The  details  are  omitted. 


For  d  =  R2,  the  minimizing  permutation  ao  may  be  constructed  as  follows.  Recall 
(from  Section  11)  that  by, . . . ,  6„,  is  an  enumeration  of  n{Ni)  such  that 

l*« -^12 -2  I*.. -^1- 

The  minimizing  ao  satisfies  the  conditions  of  the  lemma,  plus  the  additional  constraints: 


n  +  1 


Oq  (6,-)  e  Ny  whenever  either  i  is  odd  and  6t-  < 

,  ,  .  ft  +  1 
or  x  is  even  and  bi  >  — - — : 

2 

ao  *(n  +  1  —  M  «  Ni  whenever  either  i  is  odd  and  b{  > 

4 


or  i  is  even  and  6,-  < 


n  + 1 


(Intuitively,  this  says  that  the  item  from  the  first  population  whose  rank  is  furthest  from 
the  median  is  placed  to  the  left  of  the  median,  the  next  furthest  is  placed  to  the  right  of 
the  median,  and  so  on.  Of  course  the  items  within  each  population  must  then  be  relabeled 
to  ensure  that  ao(l)  <  ...  <  ao(n;)  and  ao(ni  +!)<...<  ao(n).) 


The  proof  is  by  contradiction.  If  ao  does  not  satisfy  these  conditions,  one  can  find 
a  transposition  r  e  T  and  a  “relabeling  permutation”  a  e  S  such  that  jR2(rao<r,  0)  < 
R2{ ao,0).  The  details  are  omitted. 


For  the  case  of  unordered  alternatives  (Theorem  10),  the  test  statistic  is 
d{TnS,TS2  ©  05)  =  m\n{d(TirS,TrpS),d(TTrS,  TrpS)}, 

where  0  is  the  “other  type  of  extremal  permutation”: 

{*  +  ^  for  x  =  1, ...,**! 
i  —  Tiy  for  *  *=  ni  +  1, . . . ,  nj  +  2* 
t  for  x  =  n-!  +  Tp-  +  1,. . .  ,ni  +  n2. 
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The  distance  d(TirS,TipS)  was  found  in  Theorem  9.  On  the  other  hand,  it  can  also  be 
shown  that  computing  d{TitS,T^S)  again  reduces  to  Theorem  9.  (Intuitively,  one  need 
only  relabel  the  two  populations.)  The  details  of  the  derivation  are  similar  to  the  details 
for  the  unordered  alternatives  cases  in  Theorems  2,  4,  6,  and  8,  and  are  omitted.  □ 

Proof  of  Theorem  12:  This  theorem  asserts  that 

W(ir,o)  =  #{t  =  l,...,n  —  1:  7r<r-1(t  +  l)  #  jr<7-1(t)  +  1} 

is  a  metric  on  5n,  and  that  it  induces  the  Wald-Wolfowitz  statistic  for  the  two-sample 
location  problem  with  unordered  alternatives.  To  prove  it,  we  use  the  notation  of  Sections 
4,  5,  and  13.3. 


We  first  show  that  W  is  a  metric  on  Sn.  Since  W  is  obviously  right-invariant,  to  check 


that  W[ir,a)  =  W(a,  7r),  it  is  enough  to  show  that  W(ir,e)  =_W(e,?r).  For  any  ir  e  Sn 


define  It  e  Sn+1  by 


Then 


f  3r(»)  for  t  =  1, . . . ,  n 
\  n  +  1  for  *  =  n  +  1. 


=  #{•' 

=  1,.. 

. ,  n  -  1:  r(i  +  l)  #  x(i)  -f  1} 

II 

=  1,.. 

•  >  ^(*  4*  l)  *(»)  +  1}  ~  l{ir(n)^n} 

=  #{*' 

=  1,.. 

.  ,n:  *  +  1  #  *“l(»(0  +  1)}  ”  1{*-1  (»)/„} 

=  #{*' 

=  1,.. 

.  ,n:  »_1(t)  +  1  #  f-1(t  +  1)}  -  l{«-i(n)#n} 

=  #{* 

=  1... 

•  ,n  —  1:  Jr"x(i)  +  1  ^  7r_1(t  +  1)} 

=  W(t 

as  claimed.  (The  fourth  equality  uses  the  fact  that  as  i  ranges  from  1  to  n,  so  does  ff-1(t).) 


The  other  metric  property  which  is  non-trivial  to  verify  is  the  triangle  inequality.  By 
right-invariance,  this  reduces  to  showing  W(7r,o)  <  W(n,c)  4-  W(eya).  We  observe  that 


for  *  =  —  1,  if  both  c~1(i  +  1)  =  +  1  and  Jr(<r_1(*)  +  1)  =  ?r(<y”1(0)  +  1» 

then  7ra_1(t  +  1)  =  ira”^*)  +  1.  Hence 

W{n,o)  =  #{t  =  -  1:  +  1)  #  +  1} 

<  #{t  =  1  —  1:  +  1)  #  +  1} 

+  #{*  =  - 1:  jr(a_1(t)  +  l)  ^  Jr(<7_1(t))  +  1  and  <r-1(t  +  1)  =  o”l(t)  +  1} 

<  W(etff)  +  #{i  —  —  1:  jr(a_1(t)  +  1)  #  7r(<T_1(t))  +  1  and  <7-1(t)  <  n  —  1} 

<  W (e,o)  +  #{t  =  1, . . . ,  n  -  1:  jr(i  4-  l)  ^  ?r(j)  +  1} 

=  W(e,c)+W{i r,e), 

as  desired. 


Finally,  we  show  that  W  induces  the  Wald-Wolfowitz  “runs  test”  statistic,  for  the 
two-sample  location  problem  with  unordered  alternatives.  The  statistic  induced  by  W  is 


W(rSni  x  Sna,S2  ©  5ni  x  SnJ  *  min{  min  W{a,e),  min  W{a,e )}, 


atwS»t  xS», 


aeirS,,  xS., 


where 


f  *  -r  * 2  *  *  1, . . . ,  »i 

— »i  *  »nx  +  l,...,nx  + 


To  find  min  W[a,c),  let  X’s  and  0’s  denote  items  from  populations  1  and  2, 


otcirSfi]  x5^j 


respectively.  In  the  ranking  7r,  let  us  say  that  “an  X  is  followed  by  an  0”  whenever  an 
item  from  population  1  is  followed  by  an  item  from  population  2;  and  let  us  say  that  “an 
X  followed  by  a  space”  occurs  if  the  item  assigned  the  rank  n  belongs  to  population  1. 


We  notice  first  that 


min  W(a,  e)  >  #  (0’s  followed  by  an  X  or  a  space)  —  1 

atrSni  x5Bq 

+  max  {0,#(X's  followed  by  an  0)  -  1} 
+  #(Af’s  followed  by  a  space). 
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Indeed,  any  O  followed  by  an  X  or  a  space  must  contribute  1  to  W(a,  e),  unless  it  happens 
to  be  item  rij  +  n^.  Similarly,  any  X  followed  by  an  O  must  contribute  1  unless  it  is  item 
«i;  and  any  X  followed  by  a  space  contributes  1. 


On  the  other  hand,  we  can  construct  ao  £  jrSn,  x  Sni  which  actually  attains  the 
preceding  lower  bound,  as  follows.  Let  ai  <  ...  <  an,  and  ani+1  <  •••  <  an,+na  be  the 
ranks  assigned  by  x  to  populations  1  and  2,  respectively.  Choose  any  pair  (j,  k)  e  Ni  x 

such  that  a/c  =  ay  +  1  (i.e.  choose  any  X  followed  by  an  O).  Let 

'  ®*+y  t  =  i,...,ni -y 

ao(t)  =  <  a'+3~ni  t  =  n i  —  j  +  1, . . . ,  Hi 

a»+*-n  i  -i  *  —  n.i  +  1, . . . ,  ni  +  n  —  k  +  1 

^  a*+fc-n-i  *  =  ni  +  n  —  k  +  2, . . . ,  rij  +  ri2. 

(If  no  such  pair  (j,k)  exists,  just  let  cto(i)  s  a,-.)  It  is  easy  to  check  that  ao  *  x  Sna, 


and  that 


W(a0,e)  =  #  (O’s  followed  by  an  AT  or  a  space)  -  1 
+  max  {0,  #pf’s  followed  by  an  O)  —  1} 
+  #(JTs  followed  by  a  space). 


Hence  min  W (a,  e)  is  the  right  hand  side  above.  By  a  similar  argument, 


at  €  jtSrj  xSn<2 


min  VP(a,  <)  =  #[X}s  followed  by  an  O  or  a  space)  —  1 

®  c  ir5nj 

+  max  {0,  #(0’s  followed  by  an  X)  —  1} 


+  #(0’s  followed  by  a  space). 


It  follows  that 


lV(?r5Bl  x  Sn,,5j  O  Sn,  x  S„9)  =  min{  min  W(a,e),  min  lV(a,e)} 

atrSni  xS»a  atwSKlxSni 


=  #(AT’s  followed  by  an  O )  +  #(0’s  followed  by  an  Jf)  —  1 
=  #(runs  in  it)  —  2, 


which  is  equivalent  to  the  Wald-Wolfowitz  statistic,  as  claimed.  □ 
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